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Description 

Technical Reld 

s The present invention relates to blood coagulation factors in general, and more specifically, to the 

expression of proteins having biological activity for blood coagulation. 

Background Art 

JO Blood coagulation is a process consisting of a complex interaction of various blood components or 
factors which eventually gives rise to a fibrin clot. Generally, the blood components which participate in 
what has been referred to as the coagulation "cascade" are proenzymes or zymogens, enzymatically 
inactive proteins which are converted to proteolytic enzymes by the action of an activator, itself an activated 
clotting factor. Coagulation factors which have undergone such a conversion are generally referred to as 

75 "activated factors," and are designated by the addition of a lower case postscript "a" (e.g.. Vila). 

There are two separate systems which can promote blood clotting and thereby participate in normal 
haemostasis. These systems have been referred to as the intrinsic and the extrinsic coagulation pathways. 
The intrinsic pathway refers to those reactions which lead to thrombin formation through utilization of factors 
present only in plasma. An intermediate event in the intrinsic pathway is the activation of Factor IX to Factor 

20 IXa. a reaction catalyzed by Factor Xla and calcium ions. Factor IXa then participates in the activation of 
Factor X in the presence of Factor Villa, phospholipid and calcium ions. The extrinsic pathway involves 
plasma factors as well as components present in tissue extracts. Factor VII. one of the proenzymes referred 
to above, participates in the extrinsic pathway of blood coagulation by converting (upon its activation to Vila) 
Factor X to Xa in the presence of tissue factor and calcium ions. Factor Xa in turn then converts 

25 prothrombin to thrombin in the presence of Factor Va, calcium ions and phospholipid. Because the 
activation of Factor X to Factor Xa is an event shared by t>oth the intrinsic and extrinsic pathways. Factor 
Vila can be used for the treatment of patients with deficiencies or inhibitors of Factor VIII (Thomas, U. S. 
Patent 4,382,083). There is also some evidence to suggest that Factor Vila may participate in the intrinsic 
pathway as well (Zur and Nemerson. J. Biol. Chem. 253: 2203-2209, 1978) by playing a role in the 

30 activation of Factor IX 

Experimental analysis has revealed that human Factor VII is a single-chain glycoprotein with a 
molecular weight of approximately 50.000 daltons. In this form, the factor circulates in the blood as an 
inactive zymogen. Activation of Factor VII to Vila may be catalyzed by several different plasma proteases, 
such as Factor Xlla. Activation of Factor VII results in the formation of two polypeptide chains, a heavy 

35 chain (Mf = 28.000) and a light chain (Mr = 17,000). held together by at least one disulfide bond. Factor 
VII may also be activated to Vila in vitro, for example, by the method disclosed by Thomas in U.S. Patent 
No. 4,456.591. 

Factor IX circulates in the blood as a single-chain precursor of molecular weight 57,000 and is 
converted to an active serine protease (Factor IXa) upon cleavage by Factor Xla. Factor IXa consists of a 

40 light chain and a heavy chain of molecular weights 1 6,000 and 29.000. respectively. 

Current treatment practices for patients having coagulation disorders (e.g.. deficiencies of Factor VllI 
and IX) generally involve replacement therapy with cryoprecipitate or other fractions of human plasma 
containing enriched levels of a particular factor. These preF>arations have heretofore been obtained from 
pooled human plasma, although the preparation of cryoprecipitates requires the use of a relatively large 

45 amount of human plasma as starting material. 

Therapeutic uses of Factor VII exist in the treatment of individuals exhibiting a deficiency in Factor VII. 
as well as Factor VIII and Factor IX deficient populations, and individuals with Von Willebrand*s disease. 
More specifically, individuals receiving Factors Vlll and IX in replacement therapy frequently develop 
antibodies to these proteins. Continuing treatment is exceedingly difficult because of the presence of these 

50 antitjodies. Patients experiencing this problem are normally treated with an activated prothrombin complex 
known to consist of a mixture of active and Inactive clotting enzymes, including Factor Vila. Further, recent 
studies indicate that small amounts (40-50 micrograms) of injected Factor Vila are effective in controlling 
serious on-going bleeding episodes in Factor Vlll deficient patients who have high levels of antibody in their 
blood (Hedner and Kisiel, J. Clin. Invest. 71: 1836-1841. 1983). 

55 Due to the diverse sources of the plasma used in the preparation of cryoprecipitates, it is difficult to test 
th preparations to ensure that they are free of viral contamination. For instance, essentially all recipients of 
cryoprecipitate show a positive test for hepatitis. Recent reports hav also indicated that some hemophiliacs 
receiving cryopr cipitate have developed acquir d immune deficiency syndrom (AIDS). In addition, the 
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purification of larg amounts of these factors is extrennely difficult and expensive. 

Consequently, there exists a need in the art for a method of producing relatively large quantities of pure 
preparations of Factor Vila and Factor IX. The present invention fulfills this need through the use of 
recombinant DNA technology, successfully eliminating the problem of viral contamination and, at the same 
5 time, providing a consistent and homogenous source of active Factor Vila to treat Factor VII and Factor IX 
deficient patients and individuals with Von Willebrand's disease. 

Disclosure of the Invention 



10 Briefly stated, the present invention discloses a DNA construct containing a nucleotide sequence 
encoding human factor VII having an amino acid sequence as shown in Rgure 1 b. The nucleotide sequence 
may comprise a first nucleotide sequence derived from a cDNA or a genomic clone of Factor VII joined to a 
second nucleotide sequence positioned downstream of the first sequence, said second nucleotide^ sequence 
derived from a cDNA clone of of Factor VII. The joined sequences code for a protein which upon activration 

T5 has Factor VII a biological activity for blood coagulation. Further, the first nucleotide sequence may also 
encode a leader peptide and may also include a double-stranded oligonucleotide. 

In addition, the present invention discloses recombinant plasmids capable of integration in mammalian 
host cell DNA. One of the plasmids includes a promoter followed downstream by a set of RNA splice sites, 
the RNA splice sites being followed downstream by a nucleotide sequence which codes for Factor VII. The 

20 sequence codes for a protein which upon activation has Factor Vila biological activity for blood coagulation. 
The nucleotide sequence is then followed downstream by a polyadenylation signal. 

Similar to the recombinant plasmid noted above, the present invention also discloses a second plasmid 
which includes a promoter followed downstream by a set of RNA splice sites, the RNA splice sites being 
followed downstream by a nucleotide sequence which codes at least partially for Factor IX. The nucleotide 

25 sequence comprises a first nucleotide sequence which encodes a calcium binding domain joined to a 
second nucleotide sequence positioned downstream of the first sequence. The second nucleotide sequence 
encodes a catalytic domain for the serine protease activity of Factor IX. The joined sequences code for a 
protein having substantially the same biological activity for blood coagulation as Factor IX. The nucleotide 
sequence is then followed downstream by a polyadenylation signal. 

30 A third aspect of the invention discloses mammalian cells stably transfected to produce a protein having 
susbtantially the same biological activity, upon activation, as Factor Vila. The cells are transfected with a 
DNA construct containing a nucleotide sequence which codes for Factor VII. The sequence codes for a 
protein which, upon activation, has Factor Vila biological activity for blood coagulation. 

The present invention further provides for a method of producing a protein having biological activity for 

35 blood coagulation mediated by Factor Vila through establishing a mammalian host cell which contains a 
DNA construct containing a nucleotide sequence which codes for Factor VIL The sequence codes for a 
protein which, upon activation, has Factor Vila biological activity for blood coagulation. Subsequently, the 
mammalian host is grown in an appropriate medium containing vitamin K and the protein product encoded 
by the DNA construct and produced by the mammalian host cell is isolated. The protein product is then 

40 activated to generate Factor Vila. 

Yet another aspect of the present invention discloses a DNA construct comprising a DNA sequence 
encoding Factor VII having an amino acid sequence as shown in Rgure 1b. In a preferred embodiment, the 
DNA sequence comprises the cDNA sequence of Figure lb from bp 36 to bp 1433. In another preferred 
embodiment, the DNA sequence comprises the cDNA sequence of Rgure 1 b from bp 36 to bp 99. followed 

45 downstream by the sequence from bp 166 to bp 1433. Recombinant plasmids capable of integration in 
mammalian host cell DNA comprising the DNA sequences described immediately above are also disclosed. 

Mammalian cells stably transfected with a recombinant plasmid comprising a DNA sequence encoding 
Factor VII having an amino acid sequence as shown in Figure lb are also disclosed. In preferred 
embodiments, the DNA sequence comprises the cDNA sequence of Rgure lb from bp 36 to bp 1433. or 

50 the cDNA sequence of Rgure 1 b, from bp 36 to bp 99. followed downstream by the sequence from bp 1 66 
to bp 1433. 

A method for producing a protein having biological activity for blood coagulation mediated by Factor 
Vila through establishing a mammalian host cell that contains a DNA construct as described above is also 
disclosed. The mammalian host cell is subsequently grown in an appropriate medium containing vitamin K. 
55 and the protein product encoded by the DNA construct is isolated. The protein product is then activated to 
generate Factor Vila. 

Other aspects of the invention will become evident upon reference to the following detailed description 
and attach d drawings. 
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Brief D scription of the Drawings 



Rgure la illustrates the partial Factor VII cDIMA sequence produced by joining portions of cDNA clone*: 
XVII2115 and XVni923. 

5 Rgure lb illustrates the Factor VII cDNA sequence of XVII2463. Arrows indicate the extent of the 
deletion in the sequence of XVII565. Nunnbers above the sequence designate amino acids. Numbers below 
designate nucleotides. 

Figure 2a illustrates the amino acid sequences of the amino terminal regions of several clotting factors. 
Rgure 2b illustrates a comparison of the amino add sequence of Factor VII obtained from protein 
70 sequencing with that encoded by the cDNA. 

Rgure 3 illustrates the joining of Factor IX leader sequences to a sequence encoding a consensus 
calcium binding domain. 

Rgure 4 illustrates the joining of the Factor IX-consensus sequence hybrids to a partial F^tpr VII cDNA 
to produce an in-frame coding sequence. 
15 Rgure 5 illustrates the construction of a plasmid containing a coding sequence for a Factor IX/Factor VII 
fusion protein. 

Rgure 6 illustrates the expression vector FDWII/pD2. Symbols used are Ad2 MLP. the major late 
promoter from adenovirus 2; L1-3. the adenovirus 2 tripartite leader sequence; 5*ss, 5* splice site* 3'ss, 3* 
splice site' and pA, the late polyadenylation signal from SV40. 
20 Rgure 7 illustrates the nucleotide sequence of a Factor IX^Factor VII cDNA fusion. 

Rgure 8 illustrates expression vector pM7135. Symbols used are E. the SV40 enhancer; ori, the 0-1 
map units Ad 5; pA. the eariy polyadenylation signal from SV40; A, the deletion region of the'pBR322 
"poison" sequences; and other symbols as descrit>ed for Rgure 6. 

Rgure 9 illustrates the subcloning of the 2463bp Factor VI! cDNA. 
25 Rgure TO illustrates the subcloning of the 565bp Factor VII cDNA. 

Rgure 11 illustrates the joining of the 5' end of pVII565 and the 3* portion of pVll2463in pUC18 to 
generate pVII2397. 

Rgure 12 illustrates the construction of the expression plasmids FVII(2463)/pDX and FVII(565 + 2463)- 
/pDX, pA denotes the polyadenylation signal from SV40 in early or late orientation, as described in Example 
30 9. Other symt>ols are as described for Rgure 8. 

Best Mode for Carrying Out the Invention 



Prior to setting forth the invention, it may be helpful to an understanding thereof to set forth definitions 
35 of certain terms to be used hereinafter. 

Complementary DNA or cDNA : A DNA molecule or sequence which has been enzymatically syn- 
thesized from the sequences present in a mRNA template. 

DNA Construct : A DNA molecule, or a clone of such a molecule, either single- or double-stranded, 
which may be isolated in partial form from a naturally occuning gene or which has been modified to contain 
40 segments of DNA which are combined and juxtaposed in a manner which would not otherwise exist in 
nature. 

Plasmid or Vector : A DNA construct containing genetic information which may provide for its replication 
when inserted into a host cell. A plasmid generally contains at least one gene sequence to be expressed in 
the host cell, as well as sequences which facilitate such gene expression, including promoters and 
45 transcription initiation sites. It may be a linear or closed circular molecule. 

Joined: DNA sequences are said to be joined when the 5* and 3* ends of one sequence are attached, 
by phosphodlester bonds, to the 3* and 5* ends, respectively, of an adjacent sequence. Joining may be 
achieved by such methods as ligation of blunt or cohesive termini, by synthesis of joined sequences 
through cDNA cloning, or by removal of intervening sequences through a process of directed mutagenesis. 
50 Leader Peptide : An amino acid sequence which occurs at the amino terminus of some proteins and is 
generally cleaved from the protein during subsequent processing and secretion. Leader peptides comprise 
sequences directing the protein into the secretion patiiway of the cell. As used herein, the term "leader 
peptide" may also mean a portion of the naturally occurring leader peptide. 

Domain : A three-dimensional, self-assembling array of specific amino acids in a protein molecule which 
55 contains all or part of the structural elements necessary for som biological activity of tiiat protein. 

Biological Activity : A function or set of functions performed by a molecule in a biological context (i.e.. in 
an organism or an in vitro facsinnile). Biological activities of proteins may be divided into catalytic and 
effector activities. Catalytic activities of clotting factors generally involve the activation of other factors 
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through the specific cleavage of precursors. Effector activities include specific binding of the biologically 
active molecule to calcium or other small molecules, to macromolecules such as proteins, or to cells. 
Effector activity frequentiy augments, or is essential to. catalytic activity under physiological conditions. 
Catalytic and effector activities may, in 5ome cases, reside within the same domain of a protein. 
5 For Factor Vila, biological activity is characterized by tiie mediation of blood coagulation through the 

extrinsic pathway. Factor Vila activates Factor X to Factor Xa, which in turn converts prothrombin to 
thrombin, thereby initiating the formation of a fibrin clot. Because the activation of Factor X is common to 
both the extrinsic and intrinsic pathways of blood coagulation, Factor Vila may be used to treat individuals 
severely deficient in the activities of Factor IX, Factor VIII or Von Willebrand Factor. 

10 As noted above, the isolation of Factor VII from human plasma is a time-consuming and expensive 
process since the factor is a rare protein present only at a concentration of approximately 300 micrograms 
per liter of blood. In addition, it is difficult to separate from prothrombin, Factor IX and Factor X and is 
susceptible to proteolytic attack during purification (Kisiel and McMuIlen. ibid). Although single-chain human 
Factor VII has been purified to homogeneity (Kisiel and McMulIen. ibid), the published purification methods 

75 are generally limited by low yield and/or contamination by other coagulation factors. 

Factor VII is produced in the liver and requires vitamin K for its biosyntiiesis. Vitamin K is necessary for 
the formation of specific gamma-cartM)xyglutamic acid residues in the factors. These unusual amino acid 
residues, which are formed by a post-translational modification, bind to calcium ions and are responsible for 
the interaction of the protein with phospholipid vesicles. In addition. Factor VII contains one /3-hydroxyaspar- 

20 tic acid residue which is also formed after the protein has been translated. However, the role of this amino 
acid residue is not known. 

Given the fact that the activity of Factor VII is dependent upon post-ti-anslational modifications involving 
the gamma carboxylation of specific glutamic acid residues, and may also be dependent upon the 
hydroxylation of a specific aspartic acid residue, it is unlikely that an active product could be produced 

25 through the cloning and expression of Factor VII in a microorganism. 

Accordingly, the present invention provides a method of producing a protein having biological activity 
for blood coagulation mediated by Factor Vila using stably transfected mammalian cells. 

As noted above. Factor VII requires vitamin K for its biosyntiiesis. In addition, the plasma proteins 
prothrombin. Factor IX. Factor X. Protein C. and Protein S also require vitamin K for their biosynthesis. The 

30 amino-terminal portions of these proteins, which contain gamma-carboxyglutamic acid residues, are ho- 
mologous in both amino acid sequence and in biological function (Rgure 2a). Further, the carboxy-terminal 
portions of Factor VII. prothrombin. Factor IX. Factor X. and Protein C determine their specific serine 
protease functions. 

Factor VII is a trace plasma protein, and the mRNA encoding Factor VII is believed to be rare. 

35 Consequently, purification of Factor VII from plasma in sufficient quantities to permit extensive sequence 
analysis and characterization remains difficult. Degradation of Factor VII during purification, even in the 
presence of protease inhibitors, was noted by Kisiel and McMuIlen (ibid). Due to these difficulties. Factor VII 
has been pooriy characterized, compared to other more abundant components of the blood coagulation 
system. Indeed, the work of Kisiel and McMuIlen (ibid) yielded sequence information for only 10 residues of 

40 each chain of Factor VII, and in each sequence the identification of two residues was tentative. Partial amino 
acid sequence data for Bovine Factor VII have also been published (DiScipio et ah, ibid). 

The presumed rarity of Factor VII mRNA has contributed to the lack of knowledge of the Factor VII 
gene. The success of conventional cDNA cloning techniques is dependent on a sufficient quantity of mRNA 
for use as a template. Premature termination of reverse transcription results in the production of cDNA 

45 clones lacking the 5* end and this condition is exacerbated by low mRNA levels. Several strategies for 
cDNA cloning of low abundance message have been developed (Maniatis et al.. Molecular Cloning: 
A Laboratory Manual . Cold Spring Harbor Laboratory, 1982), but a lack of knowledge of the amino acid 
sequence of the product of interest makes it impossible to predict the DNA sequence and to design 
appropriate oligonucleotide probes. While it may be relatively straightfoward to obtain a partial cDNA clone 

50 of a gene encoding a rare protein by using these advanced strategies, full-length cDNA clones of genes 
encoding rare proteins such as Factor VII remain exceedingly difficult to obtain. 

In comparison to Factor VII. Factor IX is a relatively abundant protein and the sequence of a cDNA 
clone of the human Factor IX gene is known (Kurachi and Davie. Proc. Nati, Acad. Sci. USA 79: 6461-6464. 
1982; and Anson et al.. EMBO J. 3: 1053-1060. 1984). The structure of the Factor IX g"ine has been 

55 characterized and the amino acid sequence of the protein has been determined on the basis of the known 
nucleotide sequence. Some protein sequence data have also been published for human and bovine Factor 
IX and th sequences analyzed (DiScipio et al., ibid). The amino terminal portion of the protein contains 12 
glutamic acid residues that are converted to -y-carboxyglutamic acid (Gla) residues in the mature protein. 
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Th cleavage sites involved in the activation of Factor IX have also been identified (Karachi and Davie, ibid). 
A sequence at th 5' end of th Factor IX cDNA clone codes for a signal peptide which is typical of those 
found in most secreted proteins (Kurachi and Davie, ibid). The expression of the Factor IX gene through 
recombinant DNA methods has not been previously reported. 

5 Because of the difficulty in obtaining a full-length cDNA clone of the Factor VII gene, three novel 

approaches were adopted to supply the 5* end of the coding sequence, including the region encoding the 
leader peptide. According to the first method, a partial cDNA clone for Factor VII is joined to a fragment 
encoding the leader peptide and 5' portion of Factor IX. This approach is based on the observation that the 
amino-terminal portions of the two molecules are responsible for the calcium binding activities of the 

70 respective proteins and the discovery that the caJcium binding activity of Factor IX can substitute for that of 
Factor VII. The resultant polypeptide retains the biological activity of authentic Factor VII because the 
specific serine protease activities of the coagulation factors reside in the carboxy-terminal regions of the 
molecules. The second approach combines the partial cDNA clone with a DNA sequence encoding the 
leader and amino-terminal regions of Factor VII. The partial cDNA and amino acid sequences' of Factor VII 

1$ disclosed herein enable the screening of a genomic DNA library or cDNA library for clones comprising the 
5* portion of the Factor VII gene. The third approach involves joining the partial cDNA clone to hybrid 
coding sequences comprising a cDNA fragment encoding the leader peptide of Factor IX and a synthetic 
gene segment encoding a consensus calcium binding domain or a predicted amino terminal sequence for 
Factor Vll. The coding sequence for the amino terminus of Factor VII was established through previously 

20 unpublished amino acid sequence data disclosed herein. The consensus sequence was derived from the 
factor Vll data and published sequence data for other vitamin K-dependent plasma proteins. 

Consistent with the approach described above for screening for clones comprising the 5' portion of the 
Factor VII gene, the inventors have been successful In obtaining a full-length, correct cDNA that is suitable 
for expression. 

25 Among the cDNA clones that were generated, a clone designated ''XVII2463" contained the largest 
Factor Vll cDNA insert. It was found to contain the entire coding sequence for Factor Vll. This clone 
included a 35 nucleotide 5' untranslated region. 180 nucleotides coding for a 60 amino acid leader. 1218 
nucleotides coding for the 406 amino acid mature protein, a stop codon, 1026 nucleotides of 3' untranslated 
sequence, and a 20 base poly(A) tail (beginning at position 2463). This cDNA has now been sequenced in 

30 Its entirety on both strands. A comparison of it with two cDNA Inserts isolated eariier from clones XVII2115 
and Win 923, revealed that XVII2463 contains, on a single EcoRI fragment, a Factor Vll cDNA coding for 
Factor Vll leader and mature protein sequences. 

A second clone. XV1I565, was isolated that contained a cDNA insert that was Identical to the cDNA of 
clone XVII2463 from nucleotide 9 to nucleotide 638. except that it lacked nucleotides 100 to 165 (Rgure 

35 lb). In comparing the cDNAs to Factor Vll genomic DNA. the absent sequences correspond precisely to 
one exon-like region. Therefore, two Factor Vll cDNAs have been obtained which appear to reflect 
alternative mRNA splicing events. 

The leader encoded by XVII2463 is exceptionally long (60 amino acids) and has a very different 
hydrophoblcity profile when compared with Factor IX, protein C and prothrombin. This leader contains two 

40 mets. at positions -60 and -26. Initiation most likely beigns at the first met, since a hydrophobic region, 
typical of signal peptides, follows the met at position -60, but not the met at -26. It is interesting that the 
absent sequence in XVII565, which corresponds precisely to an exon-like region in the genomic clone, 
results in a 38 amino acid leader with a hydrophoblcity pattern more analogous to Factor IX. protein C. and 
prothrombin. 

45 Since it was not clear then which, if either, of the leaders described above was authentic, an additional 
approach was initiated in an effort to analyze the 5* end sequence. Briefly, this approach included the 
construction and screening of a human genomic DNA library, and the identification of genomic clones 
comprising Factor VII gene sequences. The 5* portion of the genomic sequence was subseqently joined to 
the cDNA to construct a full-length clone. 

50 In an additional constarct. a 5* Factor VII cDNA fragment of XVII565 containing all of the leader and 29 
amino acids of the mature coding sequence was llgated to a fragment of the cDNA of XVII2463 (containing 
the remainder of the mature protein and 3'-untranslated sequences). This "565-2463" sequence encodes a 
full-length Factor Vll cDNA sequence as a single EcoRI fragment. 

The DNA sequences described above are then Inserted into a suitable expression vector which is in 

55 turn used to transfect a mammalian cell line. Expression vectors for use in carrying out th present 
invention will comprise a promoter capable of directing the transcription of a foreign gene In a transfected 
mammalian cell. Viral promoters are preferred due to their efficiency in directing transcription. A particulary 
preferred such promoter is the major late promoter from adenovirus 2. Such xpression vectors will also 
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contain a set of RNA splice sites located downstream from the promoter and upstream from the insertion 
site for a gene encoding a protein having biological activity for blood coagulation. Preferred RNA splice site 
sequences may be obtained from adenovirus and/or immunoglobulin genes. Also contained in the expres- 
sion vectors is a polyadenylation signal, located downstream of the insertion site. Viral polyadenylation 
5 signals are preferred, such as the early or late polyadenylation signals from SV40 or the polyadenylation 
signal from the adenovirus 5:Elb region. In a particularly preferred embodiment, the expression vector also 
comprises a viral leader sequence, such as the adenovirus 2 tripartite leader, located between the promoter 
and the RNA splice sites. Preferred vectors may also include enhancer sequences, such as the SV40 
enhancer. 

10 Cloned DNA sequences may then be introduced into cultured mammalain cells by calcium phosphate 
mediated transfection. (Wtgler et al.. Cell 14: 725, 1978; Corsaro and Pearson, Somatic Cell Genetics 7: 
603, 1981; Graham and Van der Eb. Virology 52: 456. 1973.) A precipitate is formed of the DNA and 
calcium phosphate and this precipitate is applied to the cells. A portion of the cells take up the PNA and 
maintain it inside the cell for several days. A small fraction of the cells (typically 10""*) stably integrate the 

75 DNA into the genome. In order to identify these stable integrants, a gene that confers a selectable 
phenotype (a selectable marker) is generally introduced along with the gene of interest. Preferred selectable 
markers include genes that confer resistance to drugs, such as G-418 and methotrexate. Selectable 
markers may be introduced into the cell on a separate plasmid at the same time as the gene of interest or 
they may be introduced on the same plasmid. A preferred selectable marker is the gene for resistance to 

20 the drug G-418, which is carried on the plasmid pKO-neo (Southem and Berg, J. Mol. Appl. Genet. 1: 327- 
341. 1982). It may also be advantageous to add additional DNA. known as "carrier DNA." to the mixture 
which is introduced into the cells. After the cells have taken up the DNA, they are allowed to grow for a 
period of time,, typically 1-2 days, to begin expressing the gene of interest Drug selection is then applied 
to select for the growth of cells which are expressing the selectable marker in a stable fashion. Clones of 

25 such cells may be screened for expression of the protein of interest. 

Factor VII produced by the transfected cells may be removed from the cell culture media by adsorption 
to barium citrate. Spent medium is mixed with sodium citrate and barium chloride and the precipitatQ 
collected. The precipitated material may then be assayed for the presence of the appropriate clotting factor. 
Further purification may be achieved through immunoadsorption. It is preferred that the immunoadsorption 

30 column comprise a high-specificity monoclonal antibody. Alternatively, purification of the barium citrate 
precipitated material may be accomplished by more conventional biochemical methods or by high- 
performance liquid chromatography. 

Conversion of single-chain Factor Vll to active two-chain Factor Vila may be achieved using Factor Xlla 
as described by Hedner and Kisiel (J. Clin.Hnvest. 71: 1836-1841, 1983), or with other proteases having 

35 trypsin-like specificity (Kisiel and Fujikawa. Behring Inst Mitt. 73: 29-42, 1983). 

In summary, the present invention provides a method forThe production of proteins having the activity 
of Factor Vila using transfected mammalian cells. Gene sequences encoding the specific serine protease 
domain of Factor Vila are isolated from cDNA libraries. Sequences encoding the leader peptide and calcium 
binding domains are isolated from cDNA or genomic libraries or constructed from synthesized 

40 oligonucleotides. The sequences are then joined in an appropriate expression vector so as to encode Factor 
Vll. The resulting vector and a plasmid containing a drug resistance marker are co-transfected into 
appropriate mammalian tissue culture cells. Transfected cells may then be selected by addition of the 
appropriate drug, such as G-418. The protein products are then purified from the cell growth media and 
assayed for biological activity in a blood coagulation assay and for immunological cross-reactivity using 

45 antibodies prepared against authentic human Factor Vll. 

To summarize the examples which follow. Example 1 discloses the cloning of a full-length cDNA 
sequence for Factor Vll. Example 2 discloses a partial amino acid sequence of human Factor Vll. including 
the sequence of approximately 30 amino acids at the amino terminus. Example 3 discloses the construction 
and screening of a human genomic DNA library and the identification of genomic clones comprising Factor 

50 Vll gene sequences. Example 4 discloses the construction of two hybrid gene segments, each comprising a 
cDNA fragment encoding the leader peptide of Factor IX and a synthesized double-stranded fragment 
encoding a consensus calcium binding domain. The hybrid sequences are then joined to partial cDNA 
clones of Factor Vll. Using in vitro mutagenesis, the consensus sequence was then altered to conform to 
the protein sequence data "for Factor Vll. Example 5 describes the construction of a gene sequence 

55 encoding a fusion protein comprising the calcium binding domain of Factor IX and the specific serine 
protease domain of Factor Vll. Exampl 6 describes the construction of the vector pD2 for use in 
expressing proteins having biological activity for blood coagulation in transfected mammalian cells. The 
gene fusion described in Example 5 is expressed using this vector. Example 7 describes the use of the 
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V ctor p02 to express a gene for Factor IX in a transfected mammalian cell line. ExampI 8 describes th 
construction of the vector pM7135, which contains DNA sequences encoding a primary translation product 
comprising the leader sequence of Factor IX fused to Factor Vll. This vector may be used to produce a 
protein having the activity of Factor Vll in a transfected mammalian cell line. Example 9 describes the 
5 expression of Factor Vll using cDNA sequences, and the expression of Factor Vll from a genomic-cDNA 
hybrid sequence. 

The following examples are offered by way of illustration and not by way of limitation. 
EXAMPLES 

10 

Restriction enzymes were obtained from Bethesda Research Laboratqries (BRL) and New England 
Biolabs and were used as directed by the manufacturer, unless otherwise noted. Oligonucleotides were 
synthesized on an Applied Biosystems Model 380 A DNA synthesizer and purified by polyacrylamide gel 
electrophoresis on denaturing gels. E. coli cells were transformed as described by Maniatis et al. (Molecular 
IS Cloning: A Laboratory Manual , Cold Spring Hart>or Laboratory. 1982). Ml 3 and pUC cloning vectors and 
host strains were obtained from BRL Factor Vll was prepared from human plasma as described by Kisiel 
and McMuilen (ibid). 

Example 1 : Cloning of a Partial Factor Vll cONA. 

20 

A. Construction of a human liver cDNA library. 

A cDNA library was prepared from human liver mRNA by the method of Chandra et al.. Proc. Natl. 
Acad. Sci. U.S.A.80 : 1845-1848, 1983. The cDNA preparation was sedimented through an alkaline sucrose 

25 gradient (Monahan et a!.. Biochemistry 15: 223-233. 1976) and fractions containing species of greater than 
atx>ut 1000 nucleotides were pooled. The" first strand preparation was made double-stranded using reverse 
transcriptase (Chandra et al.. 1983). treated with SI nuclease, and the residual staggered ends filled-in 
using DNA Polymerase I (Klenow fragment) in the presence of all four deoxy ribonucleotide triphosphates 
(Maniatis et al.. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory. 1982). The blunt- 

30 ended cDNA was treated with Eco Rl methylase and ligated to phosphorylated Eco Rl linkers using T* DNA 
ligase (Maniatis et al.. ibid). The ligated DNA preparation was exhaustively digested with Eco Rl to remove 
excess linker sequences and double-stranded DNAs greater than about 1000 base pairs in length were 
purified by neutral sucrose gradient centrifugation (Maniatis et al.. Ibid). Native Xgtll DNA was ligated into 
concatemers, digested to completion with Eco Rt. and the 5* terminal phosphates were removed by 

35 treatment with bacterial alkaline phosphatase. The pooled human liver cDNA was ligated with the phage 
DNA. packaged in vitro (Maniatis et ai.. ibid), and used to infect E. coli Y1088 (Young and Davis. Science, 
222 : 778-782. 19iB3). Approximately 14 x 10^ primary phage "plaques were generated in tiiis library, 
composed of seven libraries of - 2 x 10^ plaques each. Greater than 90% of these were recombinants 
containing human DNA inserts, based on their lack of ;3-gaIactosidase activity and characterization of 20 

40 random clones by Eco Rl digestion followed by agarose gel electrophoresis. The cDNA library, in the form 
of pheige particles, was purified by cesium chloride gradient centrifugation and stored in SM buffer (Maniatis 
et al.. ibid). 

B. Screening of the human liver cDNA library for Factor Vll clones. 

45 

The human liver expression cDNA library described above was screened for specific antigen (Young 
and Davis, ibid) using an ^*®l-labeled monoclonal Factor Vll antibody prepared by the method of Brown et 
^' (*^' Biol. Chem. 225 : 4980-4983. 1980) using purified Factor Vll. Screening of 6 x 10^ phage plaques 
identified one isolate, designated XVII2115. which gave a positive response with the antibody. 
50 The phage clone XVII2115 was tested against two other anti-Factor Vll monoclonal antibodies and a 
rabbit polyclonal antibody to Factor Vll. Isolate XVU2115 gave a positive response to all these anti-Factor Vll 
antibodies. 

DNA was prepared from a plate lysate (Maniatis et al.. pp. 65-66, 1982) of XVH2115. Digestion of this 
DNA with Eco Rl liberated an insert of 2139 base pairs. This insert was subcloned into Ml 3 phage vectors 
55 (Messing. Meth. in Enzymology 101 : 20-77. 1983; and Norrander et al.. Gen 26: 101-106, 1983) for chain 
termination dideoxy DNA sequencing (Sanger et al., Proc. Nati. Acad. Sci. UTS.A. 74: 5463-5476, 1977). 
This cDNA insert contains Pst I sites at positions 214, 839, and 1205 (designated Pstla. Pst lb, and Pst Ic. 
respectively, in Figure la) and a Sma I site located at position 611. The following Ml 3 templates were 
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sequenced: 

1) full-length (2139 bases) Eco RIa Eco Rib fragment in M13mp18 (designated clone F7-1); 

2) Pst la Eco RIa 214 base fragment in M13mp19 (F7-2): 

3) Pst la Pst lb 625 base fragment in Ml3mpl8 {F7-3); 

4) Pst lb Pst la 625 base fragment in Ml3mp18 (F7-7): 

5) Sma I Pst lb 228 base fragment in M13mp10 (F7-8): 

6) Pst lb Pst Ic 366 base fragment in M13mp18 (F7-9): 

7) Pst Ic ^ Pst lb 366 base fragment in M13mp18 (F7-10); 

8) Pst Ic Eco Rib 930 base fragment in M13mp19 (F7-11); and 

9) Eco Rib Eco RIa full-length fragment in M13mpl8 (F7-12) 
(restriction site designations refer to Rgure 1 a). 

The data confirmed the sequence on both strands for 91% of the coding region and 15% of the 3' non- 
coding region and yielded single-stranded sequence information for the remaining 9% of the coding region 
and 85% of the non-coding region. 

Comparison of the amino acid sequence predicted from the cDNA sequence with the known amino acid 
sequence data of Kisiel and McMullen (Thrombosis Research 22: 375. 1981) and the amino acid sequence 
shown below (Example 2) revealed an anomaly which could be explained by the absence of three 
nucleotides in the DNA sequence near position 400. To obtain additional sequence data. XVII2115 was 
digested with Eco Rl, and the Factor VII coding fragment was inserted into pUC 13 (Vieira and f^essing, 
1?* 259-268. 1982: and Messing, ibid) which had been digested with Eco Rl. The resultant 
recombinant plasmid, designated pUCVII2115, was digested with Xba i which cut at position 328. The 
digested sample was divided in half: half was labeled with a^P dCTP and DNA Polymerase I (Klenow 
fragment) (Englund, P.T.. J. Mol. Bio . 66: 209. 1972); the other half was labeled with -y^^p f^jp 
polynucleotide kinase (Chaconas et al., Biochem. Biophys. Res. Comm. 66: 962. 1975). The labeled 
plasmids were then recut with Pst I to yield 113 and 509 base pair fragments."Bbth strands of each of these 
were sequenced by the method of Maxam and Gilbert (Meth. in Enzymology 74: 560. 1980). The 113 base 
pair fragment was sequenced in its entirety and 210 base pairs of the 509 base pair fragment wera 
sequenced. These sequences revealed three additional bases (one C and two G's) which rendered the DNA 
sequence data in agreement with the protein sequence data, indicating that the previous anomalous results 
arose from compressions on the sequencing gel due to secondary structure involving G's and C's. The 
sequence of the last 9% of the coding region on both strands was also confirmed. 

Further analysis of the sequence of the pUCVII2115 insert confirmed that a portion of this cloned 
fragment encoded a sequence of 1 1 amino acids known to be at the cleavage site of Factor VII (Kisiel and 
McMullen. Thrombosis Research 22: 375. 1981). Comparison of this sequence to Factor IX (Davie et al.. 
ibid) and Factor X (Leytus et ah. Proc. Natl. Acad. Sci. U.S.A. 81: 3699-3702. 1984) amino acid sequences 
suggested that the clone contained the sequence for Factor VII beginning at (approximately) nucleotides 
coding for amino acid 36 of the mature Factor VII protein and continuing through approximately 1000 
coding and 1100 noncoding nucleotides and poly A sequence. In addition, it was found that this clone had 
frameshift mutations in the 3* coding portion. 

In order to obtain the correct 3' coding region, all 14 million clones of the seven Xgtll cDNA libraries 
were screened by plaque hybridization (Benton and David. Science 196: 180-181. 1977) with nick-translated 
cDNA of XVil2l15 (Maniatis et al.. pp. 109-112. 1982). 

Seven positive isolates were then screened by dideoxy sequencing of pUC plasmids into which the 
cDNA inserts had been subcloned (Wallace et al.. Gene 16: 21. 1981). The Xgtl 1 clones were digested with 
Eco Rl and the Factor VII fragments were inserted into pUC13 which had been cleaved with Eco Rl. Ail 
except one of these were found to start at a position corresponding to base 212 of the insert in XVII2115: 
the one exception consisted only of 3* non-coding sequence. One of the clones starting at base 212 was 
selected for analysis and was designated clone pUCVII1923. 

Because analysis of pUCVII2115 indicated the presence of frameshift mutations between positions 657 
and 815. pUCVIIl923 was first analyzed in this region by Maxam-Gilbert sequencing. Plasmid pUCVIIl923 
was digested with Nar I (position 779 in Rgure la). The cut DNA was labeled with a^^p ^Qjp ^gj^g q^^^ 
polymerase I (Klenow fragment) and subsequently digested with Ava I (which cleaves at the same site as 
Sma I in Rgure I) and Taq I (site at 1059). yielding a Nar l-Ava ! 166 bp fragment and a 200 bp Nar l-Taq I 
fragment Each of these was sequenced. A C. missing in pUCVII2ll5. was found at position 697 and 
another C, also missing in pUCVII2115. was found at postition 798. 

The rest of the sequence of the coding region of pUCVII1923 was shown to be correct by sequencing 
by the dideoxy method on an Ml 3 subclone of the entire insert of pUCVII1923. The Lac primer 2C87 (Table 
1) was used to sequence from position 212 (Rgure la) to 512; primer ZC218 (CTCTGCCTGCCGAAC) was 
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used to sequ nee from 715 to 1140 and primer ZC217 (ATGAGAAGCGCACGAAG) was used for sequen- 
cing from 720 to 350. Sine th pUCVII2115 insert is corr et from position 13 (positions 1-12 include an 
artificiaJ linker) to 695. and pUCVII1923 is correct from position 212 to the end. the two were spliced 
together to yield a molecule correct from position 13 (Rgure la) to the end. A convenient point utilized for 
5 this splice is the Xba I site at position 328. The sequence of the spliced corrected molecule is shown in 
Rgure la. 

Because a full-length Factor VII clone was difficult to obtain by cDNA cloning, three strategies were 
adopted to provide the missing coding sequence and the necessary upstream processing and signal 
sequences. The first strategy was to obtain the needed sequence from a human genomic OrJA library or 
10 through additional screening of cDNA libraries. The second approach was to synthesize the necessary 5' 
coding sequence, based on the amino acid sequence data for Factor VII (Example 2) and the published 
sequences of the genes encoding vitamin K-dependent clotting factors (Kurachi and Davie, ibid; and Davie 
et al.. ibid), and join this to a portion of the prepro sequence of Factor DC The third strategy relies on the 
functional homology of the amino terminal regions of Factor VII and Factor IX A sequence was constructed 

15 which comprised the coding regions for the leader and amino-terminal portion of Factor IX. This was then 
fused in the proper orientation to the partial Factor VII cDNA. 

In order to obtain DNA sequences that comprise the entire DNA sequence of Factor VII. an attempt was 
made to isolate the remaining 5* DNA sequence. This was accomplished through the utilization of the 5* 
terminal 0.3 kb EcoRI-Xbal fragment from the cDNA insert of XVII2115 to screen a cDNA library comprising 

20 2 X 10^ phage. The library was constructed using poly (A) mRNA from HepG2 cells following an adaptation 
of the method of Gubler and Hoffman (Gene 25: 263-269. 1983). The RNA was reverse transcribed to 
generate first strand cDNA, followed by second strand synthesis using DNA polymerase I and RNase H. 
Following EcoRI methylation and passage over a Sepharose® 6B column, the DNA termini were blunted 
with T* DNA polymerase. EcoRI linkers were added and excess linkers were removed by digestion with 

25 EcoRI and chromatography on Sepharose® CL 28. The DNA in the void volume was collected and ligated 
to Xgtl 1 which had been digested with EcoRI and treated with calf intestinal phosphatase. The DNA was 
packaged and infected into E. coli Y1088. Several positives were detected, and the EcoRI fragments were 
subsequentiy subcloned into M13 phage vectors for dideoxy sequencing using either the Ml 3 universal 
primer or Factor VII specific oligonucleotides. 

30 From these, three new cDNA clones of Factor VII were obtained, and their sequences completely 
determined. The largest of these cDNAs, from a clone designated XVn2463. was found to contain the entire 
coding sequence for Factor VII. This clone included a 35 nucleotide 5* untranslated region. 180 nucleotides 
coding for a 60 amino acid leader, 1218 nucleotides coding for the 406 amino acid mature protein, a stop 
codon. 1026 nucleotides of 3' untranslated sequence, and a 20 base poly(A) tail (beginning at position 

35 2463). This cDNA has now been sequenced in its entirety on both strands. A comparison of it with two 
cDNAs isolated earlier from clones XVII2115 and XVII1923 revealed that clone XVII2463 contains an 
additional 321 nucleotides upstream of the insert in XVII2115 and 519 nucleotides upstream of the insert in 
XVIII 923. The overiapping Factor VII sequences of XVII2463 and these two previous cDNAs agree, except 
that the cDNA of XVII2463 does not contain single base deletions at positions 1005 and 1106. which were 

40 detected in the cDNA of XVII2115. Thus, XVII2463 contains, on a single EcoRI fragment, a Factor VII cDNA 
coding for Factor VII leader and mature protein sequences. 

An additional cDNA, XVH565, was isolated and found to contain 5* terminal Factor VII sequences, but 
was truncated within the coding sequences. Its 5* end maps at nucleotide 9 (Rgure lb). 

When compared with full-length XVII2463. XVII565 was found to lack a sequence corresponding to one 

45 exon-like region within the leader sequence. Bases 100-165 are absent from XVn565 (Rgure lb). The 
absent sequences correspond precisely to one exon-like region by comparison with genomic sequence data 
(as described in Example III). Thus, the XVII565 structure may be a consequence of alternative splicing 
events in the leader sequence. 

The leader encoded by XVI12463 is exceptionally long (60 amino acids) and has a very different 

so hydrophobicity profile when compared with Factor IX. protein C and prothrombin. This leader contains two 
Mets, at positions -60 and -26. Initiation most likely begins at the first Met. since a hydrophobic region, 
typical of signal peptides, follows the Met at position -60. but not the Met at -26. It is interesting that the 
absent sequences in XVII565. which corresponds precisely to an exon-like region in the genomic clone, 
results in a 38 amino acid leader with a hydrophobicity pattern more analogous to the above proteins. 

55 
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Example 2: Animo Acid Sequence of Human Factor VII . 

The elucidation of the amino acid sequence of human Factor VII was desired in order to confinn the 
identrty of putative cDNA clones, substantiate the sequence of Factor VII cDNA. provide information allowing 
5 for the synthesis of specific oligonucleotide probes to screen cDNA and genomic libraries for clones 
containing the 5' sequence, and to construct a synthetic fragment encoding the amino-terminal portion of 
Factor VII. Although limited amino add sequence was provided by Kisiel and McMulIen (ibid), more 
information was needed. 

Purified human Factor Vila (Kisiel and McMullen, ibid) was reduced and carboxymethylated by the 
TO method of Crestfield et al., J. Biol. Chem. 238 : 622. 1963. The light and heavy polypeptide chains of 
carboxymethylated Factor Vila were separated by high-performance liquid chromatography (HPLC) on a 
Micro Pak C18 reverse phase column (Varian Corp.) by generating a gradient of 0.1% TFA in distilled water 
(A) and 0.1% TFA in acetonitrile (B) from 0-40% B in 5 minutes. 40-80% 8 in 25 minutes and 80-100°/, B in 
5 minutes. Approximately 300 picomoles of each peptide chain were analyzed by automated' Edman 

IS degradation using a Gas-Phase Protein Sequencer (Applied Biosystems. Inc.). Eighteen and 29 residues 
were identified at the amino-temnini of the heavy and light polypeptide chains, respectively. The amino- 
terminal sequence of the heavy chain of Factor Vila was consistent with that encoded by cDNA clone 
pUCVn2115 (Rgure 2b). Amino acid residues are designated within Figures 2a and 2b by single letter code 
as follows: A. alanine; C. cysteine: D. aspartic acid; E; glutamic acid; F. phenylalanine; G, glycine; H. 

20 histidine: I. isoleucine; K. lysine; L. leucine; M. methionine; N. asparagine; P. proline; Q, glutamine; R. 
arginine: S, serine; T. threonine; V. valine; W. tryptophan: Y. tyrosine; X indicates an unknown residue and ' 
indicates that the Gla residues (7) were assigned by homology to the structures of other known clotting 
factors and by the absence of any other phenylthiohydantoin-amino acid at those positions. The gaps (-) are 
placed to provide the best alignment among the sequences. In addition, the information indicated that the 

25 amino acids at positions five and nine were lysines and not threonine and arginine. respectively, as 
previously reported (Kisiel and McMullen, ibid). The sequence analyses of the light chain of Factor Vila, 
which originates from the amino-tenminal region of Factor VII. fell short by approximately 6 residues to 
overiap with the structure encoded by the 5* end of cDNA clone pUCVI12115. 

To obtain additional sequence data, two nanomoles of the carboxymethylated light chain were digested 

30 for 12 hours by bovine chymotrypsin (1:100 w/w. enzyme: substrate) in 0.1 M ammonium bicartjonate. pH 
7.8. at 37 -C. The generated fragments were purified by HPLC on a Micro Pak CI 8 reverse phase column 
using the above solvents in a gradient of 0-30% B in 5 minutes, 30-60% B in 25 minutes and 60-80% B in 
10 minutes. Peptides were identified by their U.V. absorption at 220 and 280 nm. Lyophilized peptides 
(approximately 1 nanomole each) were analyzed by Edman degradation. The results (Rgure 2b) confirmed 

35 much of the cDNA sequence in the corresponding region of clone pUCVIl2115. In total. 113 of 152 residues 
(75%) of the light peptide chain of Factor Vila were identified. This sequence is identical to tiiat encoded by 
the known cDNA sti-ucture. Indirect evidence indicates Asn 145 is a site of carbohydrate attachment. 

Example 3: Cloning of the genomic Factor VII sequence . 

40 ' 

As one approach to providing the 5' end sequence lacking from the cDNA. a lambda phage library 
containing human fetal liver DNA (Uwn et al.. Cell 15: 1157-1174) was screened with nick translated Factor 
VII cDNA. A portion of the genomic library was plated on E. coli LE392 (ATCC 33572) to produce a total of 
7.2 X 10^ plaques (Maniatis et al.. ibid. pp. 320-321). The phage plaques were adsorbed from the plates 
45 onto nitrocellulose and hybridized with the ^^p-iabeled cDNA according to the procedure of Benton and 
Davis (Science 196 : 180. 1977), Eight clones were obtained and plaque purified. 

Using a DNA fragment (Eco RIa-Xba I, Rgure 1) from the 5' end of the Factor VII cDNA (XV112115) and 
standard techniques (Maniatis et al.. ibid) those genomic clones containing 5* end sequences were 
identified. These phage were designated 7m1. 7m2 and 7m3. DNA was prepared from these recombinant 
phage and preliminary restriction endonuclease maps derived. Phage 7m1, which gave the strongest 
hybridization signal, was used to generate a more extensive restriction map and to place the Eco Rl-Xba 1 
cDNA sequences on tiiis map by Southern blotting (Southern, J. Mol. Biol. 98: 503, 1975). 

In order to determine if phage 7m1 contained the DNA sequences encoding the amino terminal amino 
acids of the Factor VII protein. Southern blots of phage DNA restriction digests were hybridized with 
mixtures of oligonucleotides whose sequences were deduced from the Factor VII amino terminal amino acid 
sequ nee. Oligonucleotides ZC188, ZC360. and ZC401 (Table 1) were radioactively labeled with T* 
polynucleotide kinase and hybridized to the phage DNA blots at a few degrees centigrade below their Tm 
(Wallace. R. B.. et al.. Nuc. Acids Res . 6: 3543-3557. 1979). The results of Uiis analysis indicated that a 3.7 
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kb Sst I fragment of 7m 1 contained sequences hybridizing to these oligonucleotides. This Sst I fragment 
was subcloned into M13 for DNA sequence analysis. Results obtained using ZC360 as sequencing primer 
identified a region approximately 60 nucleotides in length, which corresponded to the amino-terminai protein 
sequence data. 



Oligonucleotide Sequence 

2C87 TCC CAG TCA CGA CGT 

T G A - ■ ^ 

2 CI 8 8 GCC GGG CTCA CTC CTC CA GAA GGC GTTGG 

C A G 

2C212 GAG CTG CAG GAT CCA TGC AGC GCG TGA ACA TGA 

TCA TGG 

2C213 GAG GCC TGG TGA TTC TGC CAT GAT CAT GTT CAC 

GCG CTG 

2C217 ATG AGA AGC GCA CGA AG 

ZC218 CTC TGC CTG CCG AAC 

2C235 GAT CCA TGC AGC GC 

2C24 9 AGA ACA GCT TTG TTC TTT CA 

2C275 GCC CCC ATT CTG GCA 

ZC286 CCA AAG AGG GCC AAC GCC TTC CTG GAG GAG AGA 

CCT GGG AGC CTG GAG AGA GAG TGT ATT GAG G 

ZC287 AAT ACA CTC TCT CTC CAG GCT CCC AGG TCT CTC 

CTC CAG GAA GGC GTT GGC CCT CTT TGG 

2C288 AGC AGT GTA GCT TCG AGG AGA ACA GAG AGG TTT 

TCG AGG CCA GCG ACG 

ZC289 AAT TCG TCG CTG GCC TCG AAA ACC TCT CTG TTC 

TCC TCG AAG Cl'A CAC TGC TCC 

ZC333 CAG CTT CGT CCT GTC GCT GGC CIC 
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ZC336 



CCT CTT TGG GCC TGG TGA 



2C360 



C C C C G 
CA TC TC TC TT CA 
T T T T A 



ZC401 



CGT AGC GTT CAG GCC CTC GAA GAT CTC GCG GGC 
CTC CTC GAA GCT ACA C 



Since genomic clone 7m 1 was known to contain 7kb of sequences upstream of exon 2. this-dond was 
anticipated to encode Factor VII 5*-untranslated sequences and the leader sequences up to the amino acid 
position -17. In order to confirm that exon 1 was encoded within genomic clone 7 ml. the leader sequence 
information from clones XVII2463 and XVII565 was used to design oligonucleotides ZC528 and ZC529 
(shown below). 



These were used to probe 7m1 DNA, and a sublcone. 7SD. was found that hybridized to both 
oligonucleotides. Exon 1 was determined to be composed of two exonic sequences: exon la. which 
hybridized to ZC528 (corresponding to nucleotides 1 to 30 in XVII2463). and exon 1 b. which hybridized to 
2C528 (corresponding to nucleotides 119 to 148 in XVII2463). The intron sequences flanking both exons la 
and lb have been sequenced: la contains a consensus splice donor sequence at the 3' end of the exon, 
and lb is flanked on each terminus with a consensus splice acceptor (upstream of lb) or donor 
(downstream of lb) sequence. The position of exon la within genomic clone 7m 1 has been precisely 
mapped, while that of exon lb has been mapped within a defined region. Exon lb sequences are present in 
XVII2463, while XVII565 appears to be derived from RNA spliced between exon la and exon 2, looping out 
the 1 b exonic sequence. 

A variety of 7ml subclones in pUC and Ml 3 vectors were prepared to facilitate sequencing the 
remaining exons. Appropriate oligonucleotides designed from the cDNA sequences, which correspond to 
exons 1 through 7, were used to sequence all but the last exon. The genomic sequence corresponds 
exactly to the cDNA sequences through these regions. In addition, the intron/exon boundaries for exons 1-7 
have been determined, and most are now precisely mapped within clone 7m 1. The intron sizes and 
positions within the Factor VII gene are listed in Table 2. 



2C528 



5' 



TCA ACA GGC AGG GGC AGC ACT GCA GAG ATT^ ' 



2C529 



5' 



TTC CAC GGC ATG TCC CGT GTT TCT CCT CCT^ ' 
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Intron/Exon Junctions in the Factor VII Gene 



Intron Amino Acid Position Intron Size (Kb) 



A 


-39 


>0.2 


B 


-17 


>1.0 


C 


37/38 


1 .92 


D 


46 


0.068 


E 


84 




F 


131 


*1 


G 


167/168 


0.56 


H 


209 


1.31 



Phage 7m 1 was known to lack the Factor VII 3*-tenminus. which includes exon 8. In order to obtain 
these sequences, a 12-13 kt>-enriched Bam HI library in XL47.1 (Loenen and Brammer, Gene 10: 249. 1980; 
Maniatis. et al., ibid .), derived from human dermal primary fibroblast cells, was probed with two nick- 
translated Factor Vil cDNA PstI fragments (corresponding to sequences In exon 7, and to 3'-untranslated 
sequences). A clone, designated 7DC1. was detected by both probes. Subsequent restriction endonuclease 
and Southern blot analysis established that clone 7DC1 overlaps with, and extends approximately 3 Kb 
beyond the terminus of. clone 7m1. and that it contains exon 8. The 3.9 Kb pCbal-BamHI) fragment from 
7DC1 DNA containing exon 8 was subcloned into Ml 3. and sequence analysis was performed using 
oligonucleotides complementary to its 5* and 3' termini. The entire exon sequence is present in this clone. 

Example 4: Factor IX-Factor VM Hybrid Genes Containing a Synthesized Coding Sequ ence. 

A. Construction of a hybrid Factor IX leader-synthetic Factor VII 5* coding sequence. 

The second altemative for obtaining the 5* coding sequence for Factor VII was synthesis of an 
appropriate double-stranded fragment, using a nucleotide sequence predicted on the basis of the amino 
terminal amino acid sequence of Factor VII, the amino acid sequences of other vitamin K-dependent clotting 
factors, and the known nucleotide sequences of other vitamin K-dependent clotting factor genes (Kurachi 
and Davie, ibid: Anson et al.. EMBO J. 3: 1053-1060.1984; and Davie et al.. ibid). In order to provide the 
necessary secretion and processing signals for secretion of a mature Factor VII analog, this synthetic 
fragment (the consensus sequence) was joined to one of two leader sequences derived from a Factor IX 
cDNA clone. This strategy is outiined in Rgure 3. 

A cDNA coding for human Factor IX was obtained from a library made with mRNA from human liver 
(Kurachi and Davie, ibid). The Factor IX sequence was isolated from the pBR322 vector by digestion with 
Pst I and was inserted into the Pst I site of pUC13. This plasmid was designated FIX-pUCl3. In order to 
remove the G-rich region which was present at the 5* end of the Factor IX insert as a result of cDNA 
cloning, a synthetic oligonucleotide adaptor was substituted for the 5* end of the cloned fragment 
Oligonucleotides ZC212 and 2C213 (Table 1) were synthesized and annealed to generate a 22 base pair 
overlap, the fragment ends filled in and cut with appropriate restriction endonucleases. and tiie resulting 
fragment was joined to the Factor IX sequence. 

To construct the adaptor, 100 pmoles each of ZC212 and ZC213 were lyophilized and resuspended in 
10 ul of lOx kinase/Iigase buffer (600 mM Tris pH 8.0, 100 mM MgCb. 100 nM DTT) plus 86 ul H2O. The 
annealing reaction was run at 65*C for 10 minutes, the mixture was slowly cooled to room temperature and 
put on ice. To this mixture was added 4 ul of 2.5 mM dNTP mix and 1 ul (8 units) T4 DNA polymerase. The 
reaction was allowed to proceed 45 minutes at 14*C. Ten ul of 5 M NH+OAc was then added and the DNA 
was xtracted once with phenol/CHCb. twice with CHCb, and was precipitated with ethanol. The DNA was 
centrifuged and resusp nded in 100 ul medium salt buffer (Maniatis et al.. ibid, p. 100), digested with 9 
units Pst I and 8 units Cfo I. and extracted as abov . 
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The modified Factor IX sequence was then constructed by combining 0.16 pmoles of the synthetic Pst 
l-Cfo I adaptor fragment, 0.14 pmoles of a 1.4 kb Cfo l-Bam HI Factor IX fragment from FIX-pUC13. and 
0.14 pmoles of a 2.7 kb Bam Hl-Pst I pUC13 vector fragment in a 20 ul reaction containing 60 mM Tris-HCI 
pH 7.5. 10 mM MgCb, 10 mM DTT, and 0.9 units T4 ligase. The reaction was incubated for 3 h at room 
5 temperature and used to transform competent E. coli JM83 (Messing, Recombinant DNA Technical Bulletin. 
NIH Publication No. 79-99, 2. No. 2, 43-48. l979).~The cells were plated with 50 ul of 2% X-gal (5 bromo-4^ 
chloro-3 idolyl-/3-D-galactoside) on L-broth containing 40 ug/ml. ampicillin and incubated at 37*C overnight. 
White colonies were picked onto another plate containing ampicillin and grown at 37 'C overnight. The 
colonies were blotted on Whatman 540 paper and the paper prepared for hybridization according to the 

10 method of Wallace et al. (Gene VB: 21, 1981), except the overnight incubation on chloramphenicol plates 
was omitted. The papers were incubated at 44* C for 2 h in 0.9 M NaCI. 0.09M Tris-HCI pH 7.5, 6 mM 
EDTA. 0.5% Nonidet P-40, 150 ug/ml E. coli tRNA. The papers were probed with ^P-labeled 2C235 (Table 
1), a 14-mer that is specific for the altered 5' end sequence. Hybridization with 1-2x10^ cpm per filter was 
carried out at 44"C in the prehybridization buffer ovemight. The filters were then washed 3 times in 6 x 

IS SSC, 0.1% SDS at 4'C and 3 times in 2 x SSC, 0.1% SDS at 44-C and exposed to X-ray film. Two 
positive clones were obtained. One of these clones was designated FIX (-G) -* pUC13. 

In order to confirm the sequence of the altered region of the Factor IX portion of the FIX(-G) — pUCl3 
construct, dideoxy sequencing directly on the pUC plasmid using the BRL reverse primer was performed 
using the method of Wallace et al., 1981 (ibid) using a primer end labeled with polynucleotide kinase and 

20 y^P ATP by the method of Ghaconas et al. (ibid). The sequence was as predicted. 

The resulting recombinant plasmid contains three Hae III cleavage sites, the first at position 39 in the 
Factor IX sequence (numbering is based on the published sequence of Anson et al. (ibid), beginning at the 
first ATG), the second at position 130. and a third in the pUCl3 poly linker. The site at 130 is a single base 
pair upstream from the codons for the Lys-Arg processing site of the prepo Factor IX molecule. In the final 

25 Factor IX-Factor Vll hybrid constructs, the Factor IX leader sequence, terminated at position 39 or 130, was 
joined to a synthetic double-stranded fragment comprising the predicted consensus sequence and the last 3 
codons of the Factor IX leader sequence. 

The synthetic consensus fragment was produced by joining oligonucleotides 2C286-ZC289 (Table 1) to' 
form a double-stranded fragment. One hundred pmole of each oligonucleotide was lyophilized and 

30 resuspended in 20 ul of 1x kinase buffer and incubated overnight at 4-0; then heated at 65*0 for 10 
minutes. Two pools were made using the kinased oligonucleotides. Pool 1 contained ZC286 + 2C287; pool 
2 contaned 2C288 + ZC289. The pooled pairs were annealed 10 minutes at 65 "C, then cooled to room 
temperature over a period of 2 hours and placed on ice for 30 minutes. 

The modified Factor IX fragment was removed from FIX(-G) — pUCl3 as a Hind lll-Eco Rl fragment. 

35 Approximately 20 ug of plasmid was digested with 30 units each of Hind III and Eco Rl in 100 ul Hind III 
buffer (BRL) containing 4 ug RNase A at 37* C ovemight. The reaction was terminated by heating at 65'C 
for 10 minutes, and the vector and Factor IX fragments were electrophoresed on a 1% agarose gel and 
purified by electro-elution. The' Factor IX fragment was precipitated with ethanol, resuspended in buffer 
containing 400 ng/ul RNase A, and digested with 9 units of Hae III overnight at 37 '0. The Hind lll-Hae III 39 

40 base pair Factor IX fragment was isolated from this digest by electrophoresis on a 1.5% agarose gel 
followed by electro-elution. To obtain the Hind lll-Hae III 130 base pair Factor IX fragment. FIX-pUCl3 was 
digested with Eco Rl and Hind Mi and the Factor IX fragment isolated as above. Approximately 3 ug of this 
Hind lll-Eco Rl fragment was digested with 6 units of Hae III at 37*0 and aliquots were removed at five ^ 
minute intervals over 30 minutes into a solution containing 50 mm EDTA. The aliquots were pooled and the 

45 Hind lll-Hae III 130 base pair fragment was purified by electrophoresis on a 5% acrylamide gel followed by 
electro-elution. 

The final Factor IX-consensus sequence hybrids were prepared by joining, in a four-part ligation, 
oligonucleotide pools 1 and 2. Factor IX Hind lll-Hae III (39 or 130 base pairs), and pUCl3 Hind lll-Eco Rl. 
The resulting plasmids were used to transform E. coli HB101 (ATCC 33694). Colonies were screened by 

50 digestion of DNA with Eco Rl and Hind III. The sequence comprising the 39 base pair Factor IX sequence 
joined to the synthetic consensus sequence is hereinafter referred to as mini-FIX-FVII. The plasmid 
containing this construct was designated pM7200(-C). The sequence comprising the 130 base pair Factor IX 
sequence joined to the synthetic consensus sequence is referred to as maxi-FIX-FVII. The plasmid 
containing this construct was designated pM7100(-C). The consensus sequence encodes a polypeptide 

55 comprising the amino acid sequence Ala-Asn-Ala-Phe-Leu-Gla-Gla-Arg-Pro-Gly-Ser-Leu-Gla-Arg-Gla-Cys- 
Lys-Gla-Gln-Cys-Ser-Phe-Gla-Gla-Ala-Arg-Gla-IIe-Phe-Gla-Gly-Leu-Asn-Arg-Thr-Lys-Leu. 
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B. Joining Factor IX-consensus sequence hybrid fragnnent to Factor VII cDNA clon . 

The Factor IX-consensus sequence hybrids (either mini or maxi) were joined to the 5' portion of the 
Factor VII cDNA and the vector pUCl3 in a three-part ligation (Figures 4 and 5). The vector fragment was 

5 produced by digesting 6 ug of pUCl3 with 10 units each of Xba I and Hind III in Hind III buffer containing 
RNase A (400 ng/ul). The mini-FIX-FVII fragment was produced by digesting 2 ug of pM7200(-C) with 10 
units each of Hind III and Eco Rl as above. The maxi-FIX- FVII fragment was similarly prepared from 
pM7100(-C). The 5* portion of the Factor VII cDNA was prepared from a plasmid (pUCG705) comprising the 
Eco Rl-Xba I 5' fragment of pUCVII2115 sutx:loned into pUC13 by digestion with Xba I and Eco Rl. Digests 

70 were run at 37 • C for 2 hours and the products were separated by electrophoresis on a 1 .5% agarose gel. 
The desired fragments were electro- eluted, extracted with phenol/CHCb and CHCb. and precipitated with 
ethanol. The three fragments. pUC13/Xba l-Hind III, Factor IX-Factor VII (mini or maxi)/Hind lll-Eco Rl. and 
5' Factor Vll/Eco Rl-Xba I were then ligated in 20 ul of ligase buffer containing 2 ul 20 mM ATR and 0.9 unit 
T* DNA ligase overnight at 4 • C. Colonies were screened by restriction analysis with Hind III and Xba I. The 

75 recombinant plasmids containing the mini- and maxl-FIX-FVII sequences were designated pM7200 and 
pM7100, respectively (Rgure 4). 

Due to the linker addition used in producing the Factor VII cDNA, modifications had to be made in the 
fusion sequences to generate correct in frame coding sequences. Both mini- and maxi-fusions contain an 
Eco Rl site at the junction between the Factor IX-consensus sequence hybrid and the Factor VII cDNA 

20 which is an artifact of the cDNA cloning process. In addition, the mini-fusion requires the addition of a C to 
change the sequence at the Hae III site from ^ AGGCCA^* to ^ AGGCCCA^ and establish the correct reading 
frame downstream of this sequence. These corrections were made by oligonucleotide-directed site specific 
mutagenesis, essentially as described for the two-primer method by Zoller and Smith (Manual for Advanced 
Techniques in h/lolecular Cloning Course . Cold Spring Harbor Laboratory, 1983). The mini-FIX-FVII frag- 

25 ment was removed from pM7200 by digestion witti Hind III and Xba I and inserted into M13mp19, The 
maxi-FIX-FVII fragment was purified from pM7100 and subcloned in a similar manner. The mutagenic 
primers ZC^3 and ZC336 (see Table I) were used for removal of the Eco Rl site and the base insertion, 
respectively. In each case, the universal primer ZC87 was used as the second primer. The mutagenic 
primers were phosphorylated by combining 40 pmoles of primer and 60 pmoles ATP with 1 unit of T* DNA 

30 kinase overnight at 60 'C. To remove the Eco Rl site from the maxi-FIX-FVII hybrid, i ug of the Ml 3 single- 
stranded template was combined with 20 pmoles each ZC333 and ZC87 in a total volume of 10 ul. The 
primers were annealed to the template for 10 minutes at 65 -C. cooled to room temperature for 5. minutes, 
then placed on ice for 5 minutes. The primers were extended using DNA polymerase I (Klenow fragment). 
To remove the Eco Rl site and correct the reading frame in the mini-FIX-FVII hybrid, 1 ug of the appropriate 

35 Ml 3 single-stranded template was combined with 20 pmoles each ZC333, ZC336 and ZC87. Annealing and 
primer extension reactions were carried out as described atjove. Plaque lifts were screened with ^P-Iabeled 
primer (ZC333 or ZC336) at 60 *C and sequences confirmed by dtdeoxy sequencing. The resultant 
constructs, comprising the maxi- and mini-FIX-FVII sequences, were designated pM7111 and pM721l. 
respectively. 

40 The consensus sequence contains several regions which do not conform to the protein sequence data 
obtained for Factor VII (Rgure 2). In order to produce a sequence which encodes a polypeptide with greater 
homology to the ami no-terminal portion of Factor VII. the consensus sequence was altered by 
oligonucleotide-directed site-specific mutagenesis. The changes made were the insertion of Leu at position 
8. substitution of lie for Lys at position 18 (numbers refer to the amino acid position after the inseertion at 

45 position 8). Asn for Ala at position 26, and the sequence Ala-Ser-Asp for Gly-Leu-Asn at positions 32-34 
(based on tentative amino acid sequence data). 

The sequence changes at positions 8 and 18 were made using pM7111 (sense strand) as template. 
Primers ZC352 CCC AGG TCT CAG CTC CTC CAG^') and ZC353 f CTG CTC CTC CTT ACA CTC 
TCT^') were annealed to the template and extended as described above. The resultant phage clone was 

50 designated pM7114. The sequence of the insert in pM7114 was confirmed by dideoxy sequencing. 

In a similar manner, the changes at positions 26-34 were made on the pM7114 template (sense strand) 
using the mutagenic primer ZC366 (**CAG CTT CGT CCT GTT CAG GCC CTC GAA GAT CTC GCG GGC 
CTC CTC GAA^') and ZC87 (Table 1) as second primer. The resultant construct was designated pM7115. 
The sequence of the entire 550 bp insert In the Ml 3 vector was determined by dideoxy sequencing and 

55 found to b correct. 
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Example 5: Constaiction of Factor IX-Factor VII cDNA fusion . 

The Factor IX-Factor VII cDNA fusion was prepared using Factor IX cDNA obtained from a human liver 
cDNA library as described by Karachi and Davie (ibid) and the Factor VII cONA sequence described in 
5 Example 1. 

The fusion point chosen for the hybrid protein was between amino acid +38 (threonine) of Factor IX 
and the first lysine encoded by the Factor VII cDNA sequence. Such a protein would be encoded by a 
sequence consisting of the first 252 bp of the Factor IX cDNA sequence and all of the pUCVII2115 Factor 
VII cDNA sequence except the first two codons. To construct this hybrid sequence, the Factor IX sequence 

10 was first fused to pUCVII2115 using convenient restriction sites. This fusion resulted in the plasmid 
FIXA/ll/12 (described below) which contains the first 310 bp of the Factor IX cDNA joined to the entire 
Factor VII cDNA sequence. To achieve the precise junction desired for the hydrid protein, the intervening 
base pairs were removed by oligonucleotide-directed mutagenesis. , . 

Joining of the Factor IX cDNA sequence to the Factor VII cDNA sequence was accomplished by ligating 

15 a 0.3 kb Hind III-Aha III fragment of FIX (-G) pUCl3 (Example 4) to a 4.7 kb Sma l-Hind 111 fragment from 
pUCVII2115 (Rgure 5). The Hind lll-Aha III fragment was prepared by digesting 3 ug of FIX(-G) pUC13 
with 40 units of Hind 111 in 40 ul of medium salt buffer (Maniatis et al., ibid) at 37 'C. 4 hours. The volume 
was then increased to 100 ul of medium salt buffer, and 5 units of Aha 111 were added and the 37 'C 
incubation continued for 18 hours. The DNA fragments were separated by electrophoresis in 1% agarose 

20 and the 0.3 kb band Isolated as described above. A Sma I parital digestion of pUCVII2115 was obtained by 
incubating 3 ug of pUCVII2115 at 25 'C for 1 hour with 4.8 units of Sma 1 in a reaction volume of 30 ul. The 
reaction was stopped by a 15-minute incubation at 65 "C. The sample was then extracted once with an 
equal volume of phenol and ethanol precipitated. 

The precipitate was collected by a 10-minute microfuge spin, rinsed with 70% ethanol and air dried. 

25 The DNA was redissolved in 30 ul of medium salt buffer and digested with 30 units of Hind 111 at 37*C for 3 
hours. The DNA was subjected to electrophoresis in 0.7% agarose and the 4.7 kb Hind lll-Sma I fragment 
isolated as described above. Equimolar amounts of the two fragments (0.048 pmoles) were ligated in a 10 
ul reaction containing 50 mM Tris-HCI pH 7.5. 10 mM MgCb. 1 mM DTT. 1 mM ATP. and 3 units of T* 
DNA ligase at 14* C for 3.5 hours and then used to transform competent E. coll RRI (ATCC 31343), The 

30 cells were grown on ampicillin plates and 12 of the resulting colonies were "screened by restriction enzyme 
digestion for the presence of the desired plasmid construction. DNA from colony 12 (FIXA/ll/12) gave the 
expected restriction enzyme digestion pattern and was used in the next step of the hybrid gene 
construction. 

The oligonucleotide-directed mutagenesis procedure was performed on a single-stranded DNA tem- 

35 plate. Thus, it was necessary to clone the fused Factor IX/Factor VII sequences into Ml3mpl9. To obtain a 
conveniently small DNA fragment, a 640 bp Hind Ill-Xba I fragment was isolated from FlXA/ll/12. This 
fragment contains 310 bp of the 5* end of Factor IX cDNA and 330 bp of the Factor VII sequence. The 
vector was prepared by digesting 1 ug of M13mp19 RF DNA with 20 units of Hind 111 and 20 units of Xba I 
in 40 ul of medium salt buffer at 37'C for 18 hours. The DNA was subjected to electrophoresis in 1.2% 

40 agarose and the linear 6.4 kb fragment isolated from the gel as described above. Hve ug of FIXA/ll/12 DNA 
was digested with 10 units of Xba I in 40 ul of medium salt buffer at 37* for 18 hours. Twenty units of Hind 
III were added and the digestion continued at 37 • C for an additional 7 hours. The resulting fragments were 
separated by electrophoresis in 1.2% agarose and the 640 bp fragment eluted as above. Ten ng of 
linearized m13mp19 and 1 ng of the 640 bp fragment were ligated at 14* C for 1 hour and then used to 

45 transform competent E. coli JM101 (Messing, Meth. in Enzymology , ibid). The cells were plated with X-gal 
and IPTG (Messing, Meth. in Enzymology , ibid) and eight light blue plaques were picked and used to infect 
2.5 ml cultures of E. coli JM103 at Asoo = 0.3. After 18 hours' growth at 37 •C. the cells were harvested by 
centrifugation in a room temperature clinical centrifuge and 20 ul of the supernatant which contains the M13 
phage was mixed with 10 ug/l ethidium bromide. By comparison with known standards, each of the eight 

50 clones had an insert of approximatiey the correct size. Single-stranded DNA was then prepared from 1.5 ml 
of the supernatants as described by Messing ( Meth. in Enzymology . ibid). This construct was then 
sequenced by the dideoxy method using the oligonucleotide 2C87 as a primer to confirm that the insert 
junction was correct. One of the correct clones (#4) was used as a template in oligonucleotide-directed 
mutagenesis to produce a functional Factor IX-Factor Vll fusion. 

55 The oligonucleotide ZC249, a 20-mer consisting of 10 bp of the desired Factor IX sequence and 10 bp 
of the desired Factor Vll sequence (Table I) was used as the mutagenic primer. The oligonucleotide ZC87, 
which hybridizes to the M13mp19 sequence, was used as the second primer. 
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The mutag n sis procedur was modified from that of Zoller and Smith (ibid). For the annealing 
reaction, 20 pmoles of ZC249 were phosphorylated by incubating overnight at 4* C in 20 ul 60 mM Tris-HCI 
pH 8.0. 10 mM MgCb, 1 mM DTT. ImM ATP, 1 unit T* kinase. The reaction was stopped by incubation at 
65 "C for 15 minutes, and the sample was lyophlized. One pmole of single-stranded clone #4 template and 

5 20 pmole of 2C87 were added in 10 ul annealing buffer (200 mM Tris-HCI pH 7.5. 100 mM MgCb. 500 mM 
NaCI, 10 mM DTT). The sample was heated to 65* C for 10 minutes, incubated at room temperature for 5 
minutes, and then placed on ice. Ten ul of the following solution was prepared fresh and added to the 
sample: 20 mM Tris-HCI pH 7.5. 10 mM MgCb, 10 mM DTT, 1 mM dNTPs. 1 mM ATP. 0.15 units/ul T* 
DNA ligase. 0.25 units/ul E. coli DNA Polymerase ! (Klenow fragment). The reaction was then incubated at 

70 15 'C for 3 hours and Sie~sample used to transform competent E. coli JM101 (Messing, Meth. in- 
Enzymology , ibid). 

The resulting plaques were lifted onto nitrocellulose and screened by hybridization to ^P-labeled 
ZC249. Dry BASS filters (Schleicher & Schuell, 0.45 um) were laid onto the agar plate and-the phage 
allowed to adsorb for 5 minutes. The filters were removed and allowed to dry for 5 minutes, placed on 

75 Whatman 3 MM paper, saturated in 0,5 M NaOH. 1.5 M NaCI for 5 minutes, air dried for 3 minutes, placed 
on Whatman paper, saturated in 1 M Tris-HCI pH 8. 1 .5 M NaCI. for 5 minutes, and air dried for 3 minutes. 
The Tris-HCI step was repeated and the filters were rinsed in 100 ml 6 x SSC for 2 minutes at room 
temperature. After air drying, the filters were baked at 80* C for 2 hours and prehybridized at 47'C (T|„-4' 
of ZC249) ovemight in 6.7 x SSC pH 6.5, 2 mg/ml E. coli tRNA. and 0.2% (w/v) each BSA, Ficoll, and 

20 polyvinylpryrolidine. 

After the prehybridization step, the fitters were incubated with 2.5 x 10^ cpm/filter of labeled ZC249 in 
the same SSC hybridization buffer at 47 'C overnight Following hybridization, the filters were washed 3 
times, 5-10 minutes each, at room temperature in 6 x SSC and exposed to X-ray film. Putative positive 
plaques were replated and screened as above. Individual plaques were then picked, and single-stranded 

25 DNA was prepared and sequenced using ZC275 as a primer. The oligonucleotide ZC275 corresponds to a 
sequence 40 bp in the 5* direction of ZC249 on the same strand (Table 1). 

Four positive plaques were identified. The entire insert in M13mp19 for one clone (FlX/VII-9) was 
sequenced by the dideoxy method using the oligonucleotides ZC87 and ZC275 and determined to be 
correct. The confirmed sequence is represented by bases 1-567 in Rgure 7, RF DNA from this clone was 

30 then used for the final step in the construction of the hybrid gene. 

Three fragments were used to make the final construction: the 0.6 kb Hind Ill-Xba I fragment from 
FIXA/ll-9 containing the fused IXA/II sequences; a 1.7 kb Xba l-Bam HI Factor VII cDNA fragment from 
pUCVII1923; and a 2.7 kb Bam Hl-Hind III fragment of pUC13. Three ug of FDWII-9 (RF DNA) were 
digested at 37 • C for 6 hours with 45 units of Xba I in a volume of 50 ul. The DNA was precipitated with 

35 ethanol. resuspended and digested at 37*C for 4 h with 50 units of Hind 111. The sample was subjected to 
electrophoresis in 1 % agarose and the 0.6 kb band electro-eluted from the paper with 1 .5 M NaCI. 50 mM 
Tris-HCI pH 8. 1 mM EDTA, phenol extracted and precipitated witii ethanol. 

To obtain the remaining Factor VII cDNA sequence. 5 ug of pUCVII1923 was digested at 37'C for 3 
hours with 36 units of Xba I in 40 ul of medium salt buffer. Then 8 ul of lOx high salt buffer. 28 ul of H2O. 

40 and 4 ul (40 units) of Bam HI were added and the reaction incubated at 37 -C for 3 hours. The DNA 
fragments were separated by electrophoresis in 1 % agarose and the 1 .7 kb fragment isolated as described 
above. 

The vector fragment was prepared by digesting 1 ug of pUCl3 with 10 units of Hind lit in 20 ul of 
medium salt buffer at 37'C for 1 hour. Two ul of 10x high salt buffer and 10 units of Bam HI were then 
45 added and the incubation continued for another 2 hours. The DNA was purified on a 1% agarose gel as 
described above. 

Equimolar amounts (approximately 0.56 pmoles) of the three fragments were ligated at room tempera- 
ture for 45 minutes in 10 ul of 50 mM Tris-HCI pH 7.5. 10 mM MgCb. 1 mM DTT, ImM ATP and 3 units T4. 
DNA ligase. The reaction mixture was used to transform competent E. coN JM83. The cells were plated on 
so medium containing 40 ug/ml ampicillin with 50 ul of 2% X-gal added to each plate. DNA was prepared from 
7 white colonies and then screened by restriction enzyme digestion. One of the clones giving the correct 
pattern was designated FIXA^II — pUCl3. 

Example 6: Expression of Biologically Active Factor VII Analogs . 

55 

The mammalian cell expression vector pD2 was chosen for expression of th FIXA/II gene in 
transfect d animal cells. It was constructed from plasmid pDHFR-lll (Berkner and Sharp. Nuc. Acids Res. 
13: 841-857. 1985) in th following manner. The Pst I sit abutting the DHFR cDNA in pDHFR III was 
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converted to a Bam HI site by conventional linkering (Scheller. R.H.. Dickerson, R.E.. Boyer, H.W.. Riggs, 
A.D.. and Itakura. K., Science 196 : 177-180. 1977). The pDHFR 111 DNA was Incubated with 10 mM Tris pH 
7,6. 6 mM i3-MSH, 6 mM NaCI. 10 mM MgCb and 2.5 units Pst 1 for 10 minutes at 37* C, followed by 
phenol extraction and ethanol precipitation. The Pst I cohesive termini were blunt ended using T* DNA 
5 polymerase. After phenol extraction and dialysis against 10 mM Tris pH 8.0. 1 mM EDTA. 0.3 M NaCI, the 
DNA was ethanol precipitated. The DNA was resuspended in 20 ul 1.4 mM ATP, 50 mM Tris pH 7.6. 10 
mM MgCi2. 1 mM dithiothreitol and then incubated with 5 ng of T* polynucleotide kinase-treated Bam HI 
linkers (New England Biolabs) and 200 units of T* polynucleotide ligase for 12 hours at 12 'C. followed by 
phenol extraction and ethanol precipitation. The DNA was digested with 90 units of Bam HI at 37 • C for 1 

10 hour, followed by electrophoresis through a 1.4% agarose gel. The 4.9 kb DNA fragment (corresponding to 
pDHFR III DNA lacking the DHFR cDNA and SV40 polyadenylation signal) was electro-eluted and 
redrcularized with polynucleotide ligase and then transfected into E. coli HB101. Ampicillln-sensitive 
colonies were screened by rapid prep analysis (Birnboim, H.C., and DolyTJ.. Nucleic Acids Research 7: 
1513-1523, 1979) and the correct clone was grown up to generate a large-scale plasmid DNA preparation. ~ 

15 The resultant plasmid was cleaved with 20 units Bam HI and treated with 2.5 ug calf intestinal 
phosphatase and electrophoresed on a 1.4% agarose gel. Twenty-five ug of pSV40 (a clone of SV40 DNA 
inserted into the Bam HI site of pBR322) were digested with 25 units of Bel I for 1 hour at 50 • C. followed 
by the addition of 25 units of Bam HI, and the incubation continued for 1 hour at 37 'C. This DNA was then 
electrophoresed on a 1.4% agarose gel. The Bam Hl-cut vector (i.e.. that lacking the polyadenylation 

20 signal) was joined to the SV40 DNA fragment (.14 to .19 map units [Tooze. J., ed.. "DNA Tumor Viruses. 
Molecular Biology of Tumor Viruses"]) containing the late polyadenylation signal by Incubating the gel- 
purified fragments (0.1 ug each) in 20 ul 50 mM Tris pH 7.6. 10 mM MgCb, 1 mM dithiothreitol, 1.4 mM 
ATP and 100 units T4 polynucleotide ligase for 4 hours at 12 'C. followed by transformation into E. coli 
RR1, Positive colonies were identified by rapid prep analysis, and a large-scale plasmid preparation ~of the 

25 correct DNA. pD2. was prepared. 

To make the Factor IXA^II expression construction, 1 ug of pD2 was digested at 37 ^ C for 1 hour with 20 
units of Bam HI in 20 ul of high salt buffer. Twenty ul of 10 mM Tris-HCI pH 8, 1 mM EDTA and 0.1 unit of 
calf alkaline phosphatase (Boehringer) were then added. The reaction was incubated at 37 • C for 1 hour and 
stopped by heating to 75*C for 10 minutes. Ten ug of FIXA^II — pUCl3 was digested at 37 for 2 hours 

30 with 150 units of Bam HI in 150 ul of high salt buffer. The DNA fragments were separated by 
electrophoresis in 1 .2% agarose and the 2.3 kb fragment was isolated. Equimolar amounts (0.01 5 pmoles) 
of the 2.3 kb Bam HI fragment and tiie pD2 vector fragment were ligated at 14- C for 2.5 hours as above. 
The reaction mixture was used to transform E. coli RR1 cells, which were then plated on medium containing 
10 ug/ml ampicillin. Plasmid DNA was prepared from 12 of the resulting colonies and screened by 

35 restriction enzyme digestion. One of the clones with the correct enzyme digestion pattern was designated 
FIX/VlI/pD2 (Rgure 6). E. coli RR1 transformed with FIXA/H/pD2 has been deposited with ATCC under 
accession number 53068. 

The procedure used to transfect baby hamster kidney (BHK) cells (available from American Type 
Culture Collection, accession number CCLIO) with FIXA/II/pD2 was similar to published methods (for 

40 example. Wigler et al.. Cell 14: 725, 1978; Corsaro and Pearson. Somatic Cell Genetics7: 603. 1981; 
Graham and Van der Eb. Virology 52 : 456. 1973). The BHK cells were grown at 37 -C. 5% CO2, in 
Dulbecco's media (plus 10% heat-inactivated fetal calf serum and supplemented with glutamine and 
penicillin-strep-tomycin) in 60 mm tissue culture Petri dishes to a confluency of 20%. A total of 10 ug DNA 
was used to transfect one 60 mm dish: 3.75 ug of FIXA/lI/pD2. 1 .25 ug of pKOneo (Soutiiern and Berg. J. 

45 Mol. Appl. Genet 1^: 327-341. 1982) and 5 ug of salmon sperm DNA. The DNAs were precipitated in 0.3 M 
NaOAc. 75% ethanol. rinsed with 70% ethanol and redissolved in 20 ul 10 mM Tris-HCI pH 8, 1 mM EDTA. 
The DNA was combined with 440 ul H2O and 500 ul of 280 mM NaCI, 1.5 mM NaHPO*, 12 mM dextrose. 
50 mM HEPES pH 7.12. Sixty ul of 2 M CaCb were added dropwise to the above mixture and the solution 
let stand at room temperature for 30 minutes. The solution was then added to the cells and the cells 

50 returned to 37 • C for 4 hours. The medium was removed and 5 ml of 20% DMSO in Dultjecco's witii serum 
were added for 2 minutes at room temperatijre. The dish was then washed rapidly with 2 changes of 
medium and incubated in fresh medium overnight. Twenty-four hours after the DNA was added, the 
medium was removed and selective medium added (10 mg/ml of G418. 498 ug/mg. Gibco, in Dulbecco's 
with serum). After 10 and 13 days, individual clones, representing cells tiiat had incorporated the pKO-neo 

55 gene and were thus resistant to G418, were transferred to 96-well (or 24-well) plates and grown up for 
protein assays. 

Cells were grown in Dulbecco's plus 10% fetal calf serum containing 5 ug/ml vitamin K (Phytonadione, 
Merck). The medium was separated from the cells and cellular debris by centrifugation, and assayed for 
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Factor VII polyp ptid (by ELISA) and for biological activity. The cells were removed from the plates with 
trypsin, washed with fresh m dium, centrifuged. and froz n at -20 • C. Th c II p II ts were then thawed in 
PBS, pelleted, and resuspended in PBS containing 0.25% Triton X-100. Samples were diluted and assayed 
for polypeptide and activity. 

5 The ELISA for Factor VII was done as follows. Two hundred microliters of a monoclonal antibody 
against human Factor VII (5 ul/ml in 0.1 M NajCOa pH 9.6) were incubated in each well of a 96-well 
microtiter plate 2 hours at 37 'C. The wells were then incubated with 220 ul of 1% bovine serum albumin 
(BSA) and 0.05% Tween 20 in PBS pH 7,2 2 hours at 37 •C. The plates were rinsed with H2O. air dried, 
and stored at 4 * C. To assay samples, 200 ul samples were incubated 1 hour at room temperature in the 

70 antibody-coated wells. The wells were then rinsed four times with 200 ul PBS containing 0.05% Tween 20. 
The wells were then incubated for 1 hour at room temperature with 200 ul of an IgG fraction of rabbit 
polyclonal antiserum against Factor VII (5 ug/ml in PBS containing 1% BSA and 0.05% Tween 20). This 
was followed by incubation with goat anti-rabbit IgG coupled to alkaline phosphatase. The welts we/e then 
rinsed four times, with PBS containing 0.05% Tween 20. To the wells were added 200 ul p-nitrophenyl 

J5 phosphate (30 mg) dissolved in diethanolamine buffer (96 ml per liter) pH 9.8 containing 56 mg/l MgCfe. 
The enzyme reaction was done at 37 • C and the development of a yellow color was monitored at 405 nm 
using an ELISA plate reader. Results obtained for cell media are given in Table 3. 

Factor VII biological activity was assayed by the one-stage clotting assay described by Quick 
(Hemorragic Disease and Thrombisis , 2nd ed.. Leat Febiger, Philadelphia, 1966). Results obtained for cell 

20 media are given in Table 3. 

Cells/ml Factor VII Factor VII 

Day (xlO"^) polypeptide ng/ml activity (ng/ml) 



25 



30 



35 



1 2.9 

2.1 25 6.0 

2 1 .9 

2.8 47 15.9 

3 1.96 

2.26 160 93 

4 4 .71 

4.14 550 300 



4S 5 8.79 

11.28 725 531 



50 



5.1 

8.4 975 600 



55 Example 7: Expression of Factor IX 

Fourteen ug of FIX(-G) pUC13 were digest d with 30 units of Bam HI in 30 ul of high salt buffer for 3 
hours at 37 'C. The DNA was then subjected to electrophoresis in 1% agaros and th 1.4 kb band 
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10 



15 



contining the Factor IX sequence was isolated from ttie gel. 

Three ug of the vector pD2 were digested with 30 units of Bam HI In 30 ul high salt buffer for 3 hours at 
37 -C. The DNA was subjected to electrophoresis in 1% agarose and the linear 1.5 kb fragment isolated. 
The DNA was then treated with 0.12 units calf alkaline phosphatase in 30 ul of 10 mM Tris-iHCI pH 8, 1 mM 
EDTA for 30 minutes at 37 • C. The salt was adjusted to 0.3 M NaOAc and the sample extracted twice with 
phenol, once with chloroform and the DNA was ethanol precipitated. The pellet was rinsed in 70% ethanol, 
dried and redissolved in 20 ul 10 mM Tris-HCI pH 8» 1 mM EDTA. Equimolar amounts (0.02 pmoles) of the 
two fragments were ligated with 10 units of T* DNA ligase as described above. The reaction mixture was 
used to transform E ^li RR1 cells. DNA from twelve of the resulting ampicillin-resistant colonies was 
screened by restriction enzyme digestion. One of the clones with the 1 .4 kb fragment inserted in the correct 
orientation was designated as FIX(-G)/pD2. E. coli RR1 transformed with FIX(-G)/pD2 has been deposited 
with ATCC under accession number 53067. 

BHK cells were co-transfected with FIX(-G)/pD2 and pKO-neo as described above. Drug-resistant cells 
were selected and prepared for ELISA and activity assay as described in Example 6. - f * 

The assay for biologicaJ activity is based on the ability of Factor IX to reduce the clotting time of plasma 
from Factor IX-deficlent patients to normal. It was done as described by Procter and Rapaport (Amer. J. 
Clin. Path. 36: 212, 1961). Results are shown in Table 4. 
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TABLE 4 



25 



Factor IX 
Cells/ml polypeptide (ng/ml) 
Day (xlO-^ ) supernatant pellet 



Factor IX 
activity ( ng/ml ) 
in supernatant 



% active 
protein in 
supernatant 



1 1.65 



2 2.66 



57 
45 



20 
20 



27 
24 



50% 



35 



3 9 .69 



150 
120 



60 
60 



72 
84 



58% 



14 .79 



475 
225 



160 
140 



198 
150 



50% 



45 



50.85 



875 
1000 



250 
260 



408 
438 



45% 



The amount of Factor IX polypeptide was determined by ELISA essentially as described in Example 6 
50 using polyclonal rabbit antisera to Factor IX Following the incubation of the wells with the Factor IX- 
containing samples, the wells were rinsed and incubated 1 hour at room temperature with 200 ul of affinity 
purified rabbit polyclonal anti-Factor IX conjugated to alkaline phosphatase diluted 1:1000 in PBS containing 
1% BSA and 0.05% Tween 20. The wells were then rinsed four times with PBS containing 0.05% Tween 
20, and enzyme substrate was added as above. Incubations were run at 4 • C ovemight or 37 • C for 2 hours. 
55 As shown in Table 4, 70-80% of the Factor IX polypeptide is secreted into the media, and about 50% of 
this is biologically active. No Factor IX activity was detected in the cell pellets. 

Highest levels of activity were achieved by supplementing the cell cultur medium with vitamin K 
(phytonadion . Merck) at concentrations of 1-10 mg/ml. 
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Several additional analys s were performed to demonstate that the cells were secreting authentic Factor 
IX. Samples containing Factor IX activity according to the above assay were incubated with Factor VIII- 
deficient plasma but did not affect the clotting time, indicating that the activity was due to authentic Factor 
IX rather than a non-specific clotting agent. This conclusion was further verified by depletion of Factor IX 
5 activity from the samples with a specific antibody. Ninety-seven to ninety-eight percent of the Factor IX 
activity was immuno-precipitated from cell supematants with a rabbit polyclonal antibody against Factor DC 
This antibody also precipitated over 99% of the Factor IX activity from normal plasma. No Factor IX activity 
was removed from the supematants by rabbit polyclonal antibody to erythropoietin. 

TO Example 8: Construction of an expression vector for Factor VII . 

An expression vector comprising the synthetic Factor VII 5* coding region joined to the partial Factor VII 
cDNA was constructed. The vector, designated pM7135. was generated by inserting the Factor IX leader - 
5* Factor VII sequence from pM7115 and the 3* Factor VII sequence from FIX/VII/pD2 into plasmid pD3, 

15 which comprises the SV40 enhancer and the adenovirus 2 major late promoter and tripartite leader. 

Plasmid pD3 was generated from plasmid pDHFRIII. The Pst I site immediately upstream from the 
DHFR sequence in pDHFRIII was converted to a Bel I site by digesting 10 ug of plasmid with 5 units of Pst 
I for 10' at 37*C in 100 ul buffer A (10 mM Tris pH 8. 10 mM MgCh. 6 mM NaCI, 7mM j8-MSH). The DNA 
was phenol extracted. EtOH precipitated, and resuspended in 40 ul buffer B (50 mM Tris pH 8. 7 mM 

20 MgCb. 7 mM ^-MSH) containing 10 mM dCTP and 16 units T4 DNA polymerase and incubated at 12*C for 
60 minutes. Following EtOH precipitation, the DNA was ligated to 2.5 ug kinased Bel I linkers in 14 ul buffer 
C (10 mM Tris pH 8, 10 mM MgCfe. 1 mM DTT, 1.4 mM ATP) containing 400 units T4 polynucleotide ligase 
for 12 hours at 12'C. Following phenol extraction and EtOH precipitation, the DNA was resuspended in 120 
ul buffer D (75 mM KCI. 6 mM Tris pH 7.5. 10 mM MgCb, 1 mM DTT). digested with 80 units Bel I for 60 

25 minutes at 50 'C. then electi'ophoresed through agarose. Fonnn III plasmid DNA {10 ug) was isolated from 
the gel, and ligated in 10 ul buffer C containing 50 units T4 polynucleotide ligase for 2 hours at 12'C. and 
used to transform E. coli HB101. Positive colonies were identified by rapid DNA preparation analysis, and 
plasmid DNA (designated pDHFR') prepared from positive colonies was transformed into dAM" E. coli. 

Plasmid pD2' was then generated by cleaving pDHFR' (15 ug) and pSV40 (25 ug) in I00~"urbuffer D 

30 with 25 units Bcl I for 60 minutes at 50 'C, followed by the addition of 50 units Bam HI and additional 
incubation at 37*C for 60 minutes. DNA fragments were resolved by agarose gel electrophoresis, and the 
4.9 kb pDHFR* fragment and 0.2 kb SV40 fragment were isolated. These fragments (200 ng pDHFR' DNA 
and 100 ng SV40 DNA) were incubated in 10 ul buffer C containing 100 units T4 polynucleotide ligase for 4 
hours at 12"C. and the resulting construct (pD2*) used to transform E. coli RRI. 

35 Plasmid pD2* was modified by deleting the "poison" sequences"ln the pBR 322 region (Lusky and 
Botchan. Nature 293 : 79-81, 1981). Plasmids pD2* (6.6 ug) and pML-1 (Lusky and Botchan. ibid) (4 ug) 
were incubated in 50 u I buffer A with 10 units each Eco Rl and Nru 1 for 2 hours at 37 'C. followed by 
agarose gel electrophoresis. The 1.7 kb pD2' fragment and 1.8 kb pML-1 fragment were isolated and ligated 
together (50 ng each) in 20 ul buffer C containing 100 units T4 polynucleotide ligase for 2 hours at 12 'C, 

4o followed by transformation into E. coli HB101. Colonies containing the desired construct (designated A pD2) 
were identified by rapid preparatidFranalysis. Ten ug of A pD2 were then digested with 20 units each Eco 
Rl and Bgl II. in 50 ul buffer A for 2 hours at 37 • C. The DNA was electrophoresed through agarose, and the 
desired 2.8 kb fragment (fragment C) comprising the pBR322. 3* splice site and poly A sequences was 
isolated. 

45 To generate the remaining fragments used in constructing pD3. pDHFRIII was modified to convert the 
Sac II (Sst II) site into either a Hind III or Kpn I site. Ten ug pDHFRIII were digested with 20 units Sst II for 2 
hours at 37 • 0. followed by phenol extraction and ethanol precipitation. Resuspended DNA was incut>ated in 
100 ul buffer B containing 10 mM dCTP and 16 units T4 DNA polymerase for 60 minutes at 12 'C. phenol 
extracted, dialyzed, and ethanol precipitated. DNA (5 ug) was ligated with 50 ng kinased Hind III or Kpn I 

50 linkers in 20 ul buffer C containing 400 units T4 DNA ligase for 10 hours at 12 "C. phenol extracted, and 
ethanol precipitated- After resuspension in 50 ul buffer A. the resultant plasmids were digested with 50 units 
Hind 111 or Kpn I. as appropriate, and electrophoresed through agarose. Gel-isolated DNA (250 ng) was 
ligated in 30 ul buffer C containing 400 units T4 DNA ligase for 4 hours at 12*C and used to transform E. 
coli RRI. The resultant plasmids were designated pDHFRIII (Hind III) and pDHFRIII (Kpn I). A 700 bp Kpnl- 

55 Bgl II fragment (fragment A) was then purified from pDHFRIII (Kpn I) by digestion with Bgl II and Kpn I 
followed by agarose gel electrophor sis. 

The SV40 enhancer sequenc was ins rted into pDHFRIII (Hind III) as follows: 50 ug SV40 DNA was 
incubated in 120 ul buffer A witii 50 units Hind III for 2 hours at 37 'C. and the Hind III C SV40 fragment 

23 



EP 0 200 421 B1 



(5089-968 bp) was gel purified. Plasmid pDHFRIII (Hind III) (10 ug) was treated with 250 ng caif intestinal 
phosphatase for 1 hour at 37 * C, phenol extracted and ethanol precipitated. The linearized plasmid (50 ng) 
was ligated with 250 ng Hind III C SV40 in 16 ul buffer C for 3 hours at 12 'C, using 200 units T4 
polynucleotide ligase, and transformed into E. coli HB101. A 700 base pair Eco Rl-Kpn I fragment (fragment 

5 B) was then Isolated from this plasmid. 

For the final construction of pD3, fragments A and B (50 ng each) were ligated with 10 ng fragment C 
with 200 units T4 polynucleotide ligase for 4 hours at 1 2 ' C, followed by transfection of E. coli RRI. Positive 
colonies were detected by rapid preparation analysis, and a large-scale preparation of pD3 w^ made. 

Expression vector pM7135 was then constructed. The replicative form of pM7115 was digested with 

70 Bam HI and Xba I and the 550 base pair fragment comprising the Factor IX leader and 5* Factor VII 
sequence was gel purified. Plasmid FlX/VlI/pD2 was digested with Xba I and Bam HI and the 1700 bp 
fragment comprising the 3' portion of the Factor VII cDNA was gel purified. Plasmid pD3 was digested with 
Bel K treated with calf alkaline phosphatase, and the three fragments joined in a triple ligation. The resciltant 
constructs were screened for the presence of a 2000 base pair Xba I fragment. A plasmid having the 

15 correct orientation was selected and designated pM7135 (Figure 8). 

Example 9: Expression of Factor VII From cDNA Clones. 

In order to express Factor VII cDNA containing a Factor VII leader, DNA from XVII2463 or XVII565 and 

20 XVII2463 was cloned into an expression vector containing the Ad2 major late promoter. SV40 enhancer 
sequences, the Ad2 tripartite leader, a splice set, and the SV40 polyadenylation signal. This vector was 
adapted so that it contains a unique EcoRI sequence as the site of cDNA insertion. The expression of 
sequences from XVII2463, which encodes a 60 amino acid leader, and from XV1I565 and XVI)2463. which 
lacks the codons for amino acids from -1 8 to -39 and thus encodes a leader 38 amino acids in length, were 

25 evaluated. Because the structure of the Factor VII leader has only been identified by cDNA cloning, and 
because of the ambiguity generated by having obtained two different 5' -terminal cDNAs. the inventors also 
constructed a genomic-cDNA Factor VII sequence. The 3' portion of XVII2463 (from the Bgl II site in exon 2; 
to the EcoRI site linkered 3' to the poly(A) tail) was adjoined to a subgenomic fragment of clone 7m1, that 
encoding exons 1 a, 1 b and the remainder of exon 2. This subgenomic fragment, reconstructed as an EcoRI- 

30 Bglll 4.4 Kb fragment, was adjoined to the XVI 12463 cDNA and cloned into a mammalian expression vector. 

Briefly, in order to construct the subclones, tiie Factor VII cDNA EcoRI fragment of XVI 12463 was cloned 
into the EcoRI site of pLIC18, and designated pVII2463. Similarly, the EcoRI cDNA insert of XVII565 was 
subcloned into pUCl8. and designated pVII565. A hybrid between the 5* portion of the Factor VII sequence 
of clone pVII565 and the 3* segment of Factor VII DNA of pVll2463 was constructed by cloning the 5*-most 

3S EcoRI-Bgl II Factor VII fragment of pVll565 and the Bgl ll-Hind III Factor VII fragment (Hind III site in 
polylinker of pVII2463) of pVII2463 into pUC 18 digested with EcoRI and Hind III. This construct was 
designated pVII2397. The inserts of p VI 12463 and p VI 12397 were removed by EcoRI digestion and gel 
purified for insertion into mammalian expression vectors as described below. 

40 A. Expression of full-length Factor VII cDNA. 

The expression of Factor VII was achieved In the vector pDX This vector was derived from pD3 
(described in Example 8 above) and pD3', a vector identical to p03 except that the SV40 polyadenylation 
signal (I.e. the SV40 Bam HI [2533 bp] to Bell [2770bp] fragment) Is in the late orientation. Thus, pD3' 

45 contains a Bam HI site as the site of gene insertion. 

To generate pDX. the EcoRI site in pD3* was converted to a Bel I site by Eco Rl cleavage, incubation 
with SI nuclease, and subsequent ligation with Bcl I linkers. DNA was prepared from a positively identified 
colony, and the 1.9 kb XhoI-PstI fragment containing the altered restriction site was prepared via agarose 
gel electrophoresis. In a second modification. Bcl l-cleaved pD3 was ligated with kinased Eco RI-Bcl t 

50 adaptors (constructed from oligonucleotides ZC 525. ^ GGAATTCT^'; and ZC526, ®*GATCAGAATTCC^) in 
order to generate an Eco RI site as the position for inserting a gene into the expression vector. Positive 
colonies were identified by restriction endonuclease analysis, and DNA from this was used to isolate a 2.3 
kb Xhol-PstI fragment containing the modifed restriction site. The two above-described DNA fragments were 
incubated together with T4 DNA ligase, transformed into E. coli HB101 and positive colonies were identified 

55 by restriction analysis. A preparation of such DNA. termed~pDX. was made (Figure 12). This DNA was 
cleaved with Eco Rl and subsequentiy incubated with calf-intestinal phosphatase. Th purified DNA was 
then incubated with T4 DNA ligase and the Factor VII Eco RI fragment from p VI 12463, or with the Factor VII 
Eco Rl cDNA fragment derived from pVlI2397. The resultant clones wer designated FVII(2463)/pDX and 
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FVII(565 + 2463)/pDX. respectively (Figure 12). After transformation into E. coli JM83 and subsequent 
identification by restriction enzyme analysis, plasmid ON A preparations we7e~made and checked by 
extensive restriction endonuclease digestion. The plasmids FVII(2463)/pDX and FVII(565 + 2463)/pDX have 
been deposited with American Type Culture Collection and have been assigned accession numbers 40206* 
and 40205. respectively. 

FVII(2463ypDX and FVI 1(565 + 2463ypDX (10 ug each) were each transfected, along with 10 ug salmon 
sperm canrier DNA, into either BHK tk'tsIS cells (Floros et al.. Exper. Cell Res. 132: 215-223. 1981) or COS 
cells, using standard calcium-phosphate precipitation. Following transfection, theTells were cultured in the 
appropriate media containing 5 ug/ml vitamin K for two days. At this time, the supernatants were assayed 
for ELISA-positive material, using a monoclonal antibody directed against Factor VII. Both FVII(2463)/pDX 
and FV!l(565 + 2463)/pDX directed the production of Factor VII polypeptide which was detected in COS cell 
supernatants, and Factor VII from FVII(565 + 2463)pDX was detected in BHK cell supernatant. Sham- 
transfected BHK cells or COS cells did not yield detectable levels of Factor VII (Table 5). " • * 



TABLE 5 

ELISA Positive 



DNA 


Cell 
Line 


Cell 
Number 


Material (ng/ml 
Culture Medium) 


FVII (2463)/pDX 


COS 


2 X 10^ 


15 


FVIK 565+2463 )/pDX 


COS 


2 X 106 


12 


Control 


COS 


2 X 10^ 


<2 


FVII(2463)/pDX 


BHK 


9 X 10^ 


62 


FVII (565+2463 )/pDX 


BHK 


9 X 10^ 


6 


Control 


BHK 


9 X 10^ 


<2 
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Transient expression of Factor VII and also tested in several other cell lines, listed in Table 6. 



Name 

1. Rat Hep I 

2. Rat Hep II 
3- TCMK 

4. Human lung 



5 * Human 
hepatoma 



6. Hep G2 



7. Mouse liver 



TABLE 6 

Reference 

Description (ATTC # ) 

Rat hepatoma H4-II-E-C3 CRL 1600 

Rat hepatoma H4-II-E CRL 1548 

Mouse Kidney, SV40 virus , , 

transformed, TCMK-1 CCL 139 

SV40 virus transformed 

WI-38 VA13, subline 2RA CCL 75.1 

Human liver adenocarcinoma 

SK-HEP-1 HTB-52 

Human hepatoma, dev. by 

Barbara Knowles/Wistar HTB 8065 
Institute 

NCTC clone 1469 CC 29.1 



8. COS 

9. BHK 

10. 293 

11. DUKX 



SV40-transf ormed CV-1 
(monkey) cells 

Baby hamster kidney 
BHK-21 (C-13) 

Human embryonic 
Kidney/Ad transformed 



CHO-DHFR 



sens 



CRL 1650 



CCL 10 



CRL 1573 

(Urlaub & 
Chasin, 
PKAS (USA) 
77: 4216- 
4220, 1980) 



Cells were cotransfected with 10 ug of either FVII(2463)/pDX or i=VII(565 + 2463)/pDX together with 1 
ug of a plasmid comprising the chloramphenicol acetyl transacetylase gene (to permit identification of 
cotransfected cells) and 10 ug of salmon sperm DNA. Mock transfected cells were used as controls. Spent 
media were assayed by ELISA after six days. Results are given in Table 7. 
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TABLE 7 

Cell 
Number ELI S A 



Sample 


Cell Line 


Plasmid 


X 10"^ 


( ng/ml ) 


1. 


Rat Hep I 


Mock 


11.6 


2.4 


2. 


Rat Hep I 


FVII (565+2463 )/pDX 


7.0 


2 


3, 


Rat Hep I 


FVII(2463)/pDX 


7.0 


<2 


4. 


Rat Hep 2 


Mock 


13.0 


<2 


5. 


Rat Hep 2 


FVII ( 56 5 +2 4 6 3 ) /pDX 


16.4 


<2 


6. 


Rat Hep 2 


FVII ( 2463 )/pDX 


14.6 


<2 


7. 


TCMK 


Mock 


18.8 


<2 


8. 


TCMK 


FVII ( 56 5+24 63 ) /pDX 


9.8 


<2 


9. 


TCMK 


FVII(2463)/pDX 


12.8 


<2 


10. 


Human Lung 


Mock 


7.2 


<2 


11. 


Human Lung 


FVII (565+2463 )/pDX 


3.4 


16.5 



30 



35 



40 



45 



50 
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12. 


Human 


Lung 


FVII(2463)/pDX 


3.4 


12.2 


5 


13. 


Buman 


Hepatoma 


MocJc 


13.0 


<2 




14. 


Buman 


Hepatoma 


PVII (565+2463 )/pDX 


11.0 


3.5 


10 


15. 


Buman 


Hepatoma 


PVIl(2463)/pDX 


6.0 


3.0 


16. 


HepG2 




Mock 


6.8 


21 




17. 


BepG2 




FVII (565+2463 )/pDX 


6.0 


45.5 


IS 


18. 


HepG2 




FVII(2463)/pDX 


6.0 


28 




19. 


Mouse 


Liver 


Mock 


3.8 


<2 


20 


20. 


Mouse 


Liver 


FVII (565+2463 )/pDX 


4.0 


<2 




21. 


Mouse 


Liver 


FVII (2463 )/pDX 


3.6 


<2 




22. 


COS 




Mock 


5.6 


<2 


25 


23. 


COS 




FVII(565+2463)/pDX 


5.6 


15.5 




24. 


COS 




FVII(2463)/pDX 


4.4 


14.5 


30 


25. 


BHK tk 


s 

"tl3 

A 


Mock 


3.0 


<2 


26. 


BHK tk 


A 


FVII(565+2463)/pDX 


5.0 


25 




27. 


BKH tk 


A 


FVII (2463 )/pDX 


4.0 


22.5 


35 


28. 


293 




Mock 


5.8 


<2 




29. 


293 




FVII(565+2463)/pDX 


6.2 


94 


40 


30. 


293 




FVII ( 2463 >/pDX 


8.2 


100 




31. 


DUKX 




Hock 


11.6 


<2 




32. 


DUKX 




FVII ( 565+2463 )/pDX 


13.0 


<2 


45 


33. 


DUKX 




FVII ( 2463 )/pDX 


13.6 


<2 



FV!l(2463)/pDX (10 ug) or FVM(565 + 2463)/pDX (10 ug) was co-transfected with 10 ug of salmon sperm 
50 DNA and 1 ug of a plasmid encoding the resistant form of dihydrofolate reductase (Simonsen and Levinson, 
Proc. Natl. Acad. Sci. USA 80: 2495-2499, 1983) in a mammalian expression vector, into BHK tR'tslS cells! 
After two days, the cells were split 1:14 and placed into selective media containing either 250 nM or 1000 
nM methotrexate (MTX) and 5 ug/ml vitamin K (phytadione, Merck). After two weeks, colonies were isolated 
and grown to 50-90% confluency. The supernatant media were then assayed for Factor VII polypeptide by 
55 ELISA. Of the 25 positive clones, 22 were anaJyzed further. The cells were plated at 5 x 10* (Group I) or 1 x 
105 (Group II) in 10 cm dishes containing 5 ug/ml vitamin K, and either 250 nM or 1000 nM methotrexate. 
Rve days later, the faster growing clones (designated by an asterisk in Table 8) were split 1 :2. then after 24 
hours, th media were changed on all plates. Twenty-three hours (Group I) or 20 hours (Group II) later, 
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supernatant m dia were harvested and cell counts were taken for each clone. The m dia wer assayed 
both by ELISA and by the one-stage clotting assay. Results are shown in Table 8, 



TABLE 8 
GROUP I - 23-hour assay 



Clone 


Cell 
Count 

Plasmid XIO'S 


ELISA 

(pg/ 

cell/ 
day) 


ELISA 
(no/ 
ml) 


Clot- 
ting . 
(na/ 
ml) 


% 

Active 


B4 


-Al* 


PVII (565+2463 )/pDX 


2 


6.5 


130 ■ 


206 


.158 


B4 


-Bl* 


FVII ( 565+2463 ) /pDX 


27 


1.9 


513 


360 


70 


B4 


-CI 


PVII ( 565+2463 )/pDX 


16 


2.5 


393 


480 


122 


B4 


-C2* 


FVII (565+2463 )/pDX 


9 


<0.2 


<20 


21 




B4 


-C3 


FVII ( 565+2463 )/pDX 


52 


1.5 


800 


910 


114 


B4 


-Dl* 


PVII ( 565+2463 )/pDX 


27 


2.0 


553 


570 


103 


B4 


-D2* 


FVII (565+246 3 )/pDX 


13 


1.2 


150 


154 


103 


B4 


-El 


FVII ( 565+2463 )/pDX 


39 


2.2 


870 


1160 


133 


B4 


-E2* 


FVII(565+2463)/pDX 


8 


2.5 


205 


240 


117 


B4 


-E3 


FVII (565+246 3 )/pDX 


23 


1.2 


275 


320 


116 


B4 


-E4 


FVII (565+246 3 )/pDX 


31 


1.3 


410 


300 


73 


B3 


-5.3* 


FVII (2463 )/pDX 


5 


8.2 


410 


290 


70 



GROUP II - 20-hour assay 



Clone 


Plasmid 


Cell 
Count 
XlO-5 


ELISA 

(pg/ 

cell/ 
day) 


ELISA 
(ng/ 
ml) 


Clot- 
ting 
(ng/ 
ml) 


% 

Active 


B3-2. 


2 


FVII ( 2463 )/pDX 


41 


2.5 


1043 


500 


48 


B3-2. 


3 


FVII ( 2463 )/pDX 


19 


3.0 


580 


610 


105 


B3-3. 


2 


FVII ( 2463 )/pDX 


13 


1.5 


197 


216 


110 


B3-4. 


2 


FVII ( 2463 )/pDX 


41 


1.8 


760 


620 


82 


B3-5. 


1 


FVII ( 2463 )/pDX 


14 


3.3 


460 


400 


87 


B3-5. 


2 


FVII(2463)/pDX 


9 


2.7 


257 


184 


72 


B6-D 




FVII(2463)/pDX 


54 


3.0 


1700 


780 


46 


B6-E 




FVII ( 2463 )/pDX 


101 


1.6 


1640 


970 


59 


B6-G 




FVII ( 2463 )/pDX 


31 


5.7 


1853 


1080 


58 


B6-M 




FVII ( 2463 )/pDX 


75 


2.2 


1743 


940 


54 
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B, Expression of Factor VII genomic-cDNA hybrid. 

An expression vector containing genomic sequences representing the Factor VII genomic 5*-terminus 
and cONA sequences from the Factor Vll gene S'-tenminus was prepared as follows. Three subclones of the 

5 genomic plasmid 7 m1 were used to reconstruct the 5'-terminus: 7Bam. 7SD and 7SE. Plasmid 7Bam is a 
3.6 Kb EcoRI-Bam HI fragment containing exon la, subcloned into pUCl2, A 0.7 Kb EcoRI-Xbal fragment, 
which contains exon la. was isolated from this sut>cIon6 and is designated fragment a. Plasmid 7SD is a 3.7 
Kb SstI fragment containing exon 1b. subcloned into pUCIS. An exon lb-containing 3.1 Kb Xbal-SstI 
fragment was isolated from this subclone and is designated fragment b. Plasmid 7SE is a 3.9 Kb SstI 

w fragment containing exons 2-4. subcloned into M13mp 19. An Sstl-Bgl 11 (0.6 Kb) fragment containing the 5* 
part of exon 2 was gel isolated and is designated fragment c. The remainder of the 3*-Factor Vll cDNA 
(fragment d) was otrtained as a 2 Kb Bglll-EcoRI fragment from pUCVII2463. Fragments a-d were ligated 
with EcoRI-cleaved and calf intestinal-phosphatased pDX, and then transformed into E. coli JM83 or HB101. 
Positive colonies were identified by restriction endonuclease analysis, and plasmid IDNA^as prepared* from 

15 these colonies. 

For expression of Factor Vll. the plasmid DNA is co-transfected into BHK or COS cells as described 
above. Transfected cells are cultured in vitamin K-containIng medium for 2 days, and the medium is 
assayed for Factor Vll by ELISA. 

From the foregoing it will be appreciated that, although specific embodiments of the invention have 
20 been described herein for purposes of illustration, various modiftcations may be made. Accordingly, the 
invention is not to be limited except by the appended claims. 

The various strains of E. Coli used in the foregoing Examples were the personal choice of the inventors. 
Persons skilled in this art will appreciate that other suitable strains of E, Coli could be substituted (for 
example if the person already has in his pwssession samples of other suitable E. Coll strains). 
25 Transformants ATCC 53067 and 53068 were deposited with the Americaln Type Culture Collection on 
28 March 1985 and plasmids ATCC 40205 and 40206 were deposited with the American Type Culture 
Collection on 25 November 1 985. all four deposits being made In accordance with the Budapest Treaty. 

The features disclosed in the foregoing description, in the following claims and/or in the accompanying 
drawings may. both separately and in any combination thereof, be material for realising the invention in 
30 diverse forms thereof. 

Claims 

1. A DNA construct containing a nucleotide sequence encoding human factor Vll having an amino acid 
35 sequence as shown in Rgure lb. 

2. The DNA construct of Claim 1 wherein said nucleotide sequence also encodes a leader peptide. 

3. The DNA construct of Claim 1 wherein said nucleotide sequence includes a synthesized double- 
40 Stranded oligonucleotide. 

4. The DNA construct of Claim 3 wherein said synthesized double-stranded oligonucleotide codes for the 
amino-terminal portion of Factor Vll, 

45 5. The DNA construct of Claim 1 wherein at least a portion of said nucleotide sequence is derived from a 
cDNA clone or a genomic clone of Factor VII. 

6. The DNA construct of Claim 1 comprising a first nucleotide sequence derived from a cDNA or a 
genomic clone of Factor Vll. joined to a second nucleotide sequence positioned downstream of said 

so first sequence, said second sequence derived from a cDNA clone of Factor Vll. the joined sequences 
coding for a protein which upon activation has Factor Vila biological activity for blood coagulation. 

7. The DNA construct of Claim 1 wherein said nucleotide sequence comprises the cDNA sequence of 
Figure lb. from bp 36 to bp 1433. 

55 

8. The DNA construct of Claim 1 wherein said nucleotide sequence comprises the cDNA sequence of 
Rgure lb, from bp 36 to bp 99. followed downstream by the sequence from bp 166 to bp 1433. 
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9. The DNA construct of Claim 1 wherein said nucleotide sequence codes for the amino acid sequence of 
Pigur 1 b from alanine, amino acid number 1 , to proline, amino acid number 406. 

10. A recombinant plasmid capable of integration in mammalian host cell DNA, said plasmid includina a 
promoter followed downstream by a nucleotide sequence according to any of the ivi^us SSmsTto 
9. said nucleotide sequence being followed downstream by a polyadenylation signaJ 

11. Mammalian cells transfected with a recombinant plasmid according to Claim 10. 

° Vlir<^mpSing-'^"'^"^ ^ "^"""^ biological activity for blood coagulation mediated by Factor 

establishing a mammalian host cell which contains a DNA construct containing a nucleotide 
sequence encoding human factor VII; ^ "uoioouae 

growing said mammalian host ceil in an appropriate medium which contains vitamin K;' ' 
' cell 'SlT"^ ''"^"'^ ^"'^ ^ construct produced by said mammalian host 

bloo^^glltetion'' ^ generate a protein which has Factor Vila biological activity for 

. 13. The method of Claim 12. including amplification of the DNA construct by cotransfection of the host cell 
w.th a gene encoding dihydrofolate reductase, wherein the appropriate medium comprises methotrex" 

14. The mettiod of Claim 12 wherein said protein product is activated by reacting the protein with a 
S.d ftl^Vn ^""^ selected from the group consisting of Factor Xlla. Factor IXa, kallikrein. Factor Xa. 

15. A phamiaceutical preparation for the treatment of bleeding disorders containing a protein havina'an 
ammo acid sequence as shown in Figure lb and free of contaminating human proteins 

16. A method of producing a protein having biological activity for blood coagulation mediated by Factor 
^T'^.'^u^ cof^Prises growing in an appropriate medium which contains vitamin K an 
estaWished mammalian host cell which contains a DNA construct containing a nucleotide sequence 
encoding human factor VII. isolating the protein product encoded by said DNA construct produced bv 
Scud mammalian host cell and activating said protein product to generate a protein which has Factor 
Vila biological activity for blood coagulation. rauioi 

17. The method of Claim 12. 13, 14 or 16 wherein said host cell is a non-hepatic cell. 

ia A method of preparing a pharmaceutical composition for the treatment of bleeding disorders which 
mettiod compnses preparing a pharmaceutical composition containing a protein produced by ttie 
mettiod of any one of Qaims 12. 13. 14. 16 and 17. 

PatentansprUche 

1 



2. 

3. 



DNA-Aufbau. der eine Nukleotidsequenz enttiSlt. welche menschlichen Faktor VII kodiert. mit einer 
Aminosauresequenz wie in Figur lb gezeigt 

DNA-Aufbau nach Anspruch 1, bet dem die Nukleotidsequenz auch ein Leader-Peptit kodiert. 
SliL-'Sid"^^^^ synthettsiertes doppe.strSngiges 

Z:e:rA;iS"^;lo^^ ^^-^"'^'^'^^ Ooppelstr^ngige O.igonukleotid fOr den 

°!^^'J^l^^" Anspruch 1. bei dem w nigstens ein Abschnitf der NukI ottdsequenz von inem 
cDNA-Won Oder einem genomischen Klon des Faktors VII abgeleitet ist 
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6. DNA-Aufbau nach Anspnjch 1, welcher eine erste Nukleotidsequenz aufweist. die von einem cDNA- 
oder einem genomischen Klon des Faktors VII abgeleitet ist, angebunden an eine zweite Nukleotidse- 
quenz. die stromabwarts der ersten Sequenz angeordnet ist, wobei die zweite Sequenz von einem 
cDNA-K!cn des Faktors V!! abge!sitet ist, wobei die verbundsnsn Sequenzen fOr ein Protein kodisren, 
das nach der Aktivierung die biologtsche Aktivitat des Faktors Vila fOr Blutgerinnung hat. 

7. DNA-Aufbau nach Anspruch 1, bei dem die Nukleotidsequenz eine cDNA-Sequenz der Figur 1b umfaBt, 
von Basenpaar 36 bis Basenpaar 1 433. 

a DNA-Aufbau nach Anspruch 1, bei dem die Nukleotidsequenz die cDNA-Sequenz der Figur lb 
aufweist. von Basenpaar 36 bis Basenpaar 99, stromaufwSrts gefolgt von der Sequenz von Basenpaar 
166 bis Basenpaar 1433. 

9. DNA-Aufbau nach Anspruch 1 . bei dem die Nukleotidsequenz fur die Aminosauresequenz der -R^ur 1 b 
von Alanin, AminosSure Nr. 1 , bis Prolin. AminosSure Nr. 406 kodiert. 

10, Rekombinantes Plasmid, geeignet zur Integration in SSugetier-Wirtszellen-DNA, wobei das Plasmid 
einen Promoter umfafit. der stromabwarts von einer Nukleotidsequenz gema8 einem der vorangehen- 
den AnsprOche 1 bis 9 gefolgt ist, wobei die Nukleotidsequenz stromabwarts von einem Polyadenilie- 
rungssignal gefolgt ist. 

11, SSugetierzellen, transfiziert mit einem rekombinanten Plasmid nach Anspruch 10. 

12. Verfahren zum Produzieren eines Proteins mit biologischer Aktivitat fOr Blutgerinnung. vermittelt durch 
Faktor Vila, umfassend: 

Einrichten einer SSugetier-Wirtszelle, welche einen DNA-Aufbau enthSlt. der eine menschlischen Faktor 
VII kodierende Nukleotidsequenz enthalt; 

Zuchten der Saugetier-Wirtszelle in einem geeigneten Medium, welches Vitamin K enthalt: 

Isolieren des Proteinproduktes. kodiert von dem DNA-Aufbau, das von der Saugetier-Wirtszelle produ- 

ziert worden ist; und 

Aktivieren des Proteinproduktes, um ein Protein zu erzeugen, das eine biologische Aktivitat fOr 
Blutgerinnung von Faktor Vila hat. 

ia Verfahren nach Anspruch 12. einschlieBIich der Verstarkung des DNA-Aufbaus durch Kotransfektion der 
Wirtszelle mit einem Gen. das die Hydrofolat-Reduktase kodiert, wobei das geeignete Medium 
Methotrexat aufweist. 

14. Verfahren nach Anspruch 12, bei dem das Proteinprodukt aktiviert wird, indem das Protein mrt einem 
proteolytischen Enzym reagiert wird, das aus der Gruppe bestehend aus Faktor Xlla, Faktor IXa. 
Kalikrein. Faktor Xa und Thrombin ausgewShlt ist 

15. Pharmazeutische Praparation fUr die Behandlung von Blutungserkrankungen. welche ein Protein mit 
einer Aminosauresequenz wie in Figur lb gezeigt und frei von kontaminierenden menschlichen 
Proteinen enth§lt. 

16. Verfahren zum Produzieren eines Proteins mit biologischer Aktivitat fur Blutgerinnung, vermittelt durch 
Faktor Vila, bei dem der ProzeB das ZOchten einer eingerichteten Saugetier-Wirtszelle. welche einen 
DNA-Aufbau, der eine menschlichen Faktor VII kodierende Nukleotidsequenz enthalt, enthalt. in einem 
geeigneten Medium, das Vitamin K enthalt, das Isolieren das von diesem DNA-Aufbau kodierten 
Proteinproduktes. das von der SSugetier- Wirtszelle produziert worden ist. und das Aktivieren des 
Proteinproduktes, um ein Protein zu erzeugen. das die biologische Aktivitat fur Blutgerinnung von 
Faktor Vila hat, aufweist. 

17. Verfahren nach Anspruch 12, 13, 14 oder 16, bei dem die Zelle eine nicht-hepatisch Zelle ist 

18. Verfahren zum Praparieren einer pharmazeutischen Zusammensetzung fur die Behandlung von Blu- 
tungserkrankungen. wobei das Verfahren das Praparieren einer pharmazeutischen zusammensetzung 
umfaBt di ein durch das Verfahren nach einem der Anspruch 12, 13, 14, 16 und 17 produziertes 
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Protein enthalt. 



10 



Revendications 

1. Produit de recombinaison d'ADN contenant une sequence de nucleotides codant pour le facteur 
humain VII, ayant une sequence d'amino-acides telle que representee sur la figure 1 b. 

2. Produit de recombinaison d'ADN suivant la revendication 1. dans lequel la sequence de nucleotides 
code aussi pour un peptide leader. 

3. Produit de reconnbinaison d'ADN suivant la revendication 1, dans lequel la sequence de nucleotides 
renfemne un oligonucleotide double brin synthetise. 

•> *- * 

4. Produit de recombinaison d'ADN suivant la revendication 3. dans lequel roligonucleotide double brin 
15 synthetise code pour la portion terminale amino du facteur VII. 

5. Produit de recombinaison d'ADN suivant la revendication 1, dans lequel au moins une portion de la 
sequence de nucleotides est derivee d'un clone d'ADNc ou d'un clone genomique du facteur VII. 

20 6. Produit de recombinaison d'ADN suivant la revendication 1. comprenant une premiere sequence de 
nucleotides derivee d'un clone d'ADNc ou d'un clone genomique du facteur VII. nee h une seconde 
sequence de nucleotides disposee en aval de la premilsre sequence, la seconde sequence etant 
denvee d'un clone d'ADNc du facteur VII. les sequences liees codant pour une proteine qui. par 
activation, possfede I'actitive biologique du facteur Vila pour la coagulation du sang. 

7. Produit de recombinaison d'ADN suivant la revendication 1. dans lequel la sequence de nucleotides 
comprend la sequence d'ADNc de la figure lb. de la paire de bases 36 k la paire de bases 1433. • 

8. Produit de recombinaison d'ADN suivant la revendication 1. dans lequel la sequence de nucleotides 
comprend la sequence d'ADNc de la figure lb. de la paire de bases 36 h la paire de bases 99, suivie 
en aval de la sequence allant de la paire de bases 166 2i la paire de bases 1433. 

9. Produit de recombinaison d'ADN suivant la revendication 1. dans lequel la sequence de nucleotides 
code pour la sequence d'amino-acides de la figure lb allant de ramino-acide. alanine, numero 1 k 
Tamino-acide. proline, numero 406. 

10. Plasmide recombinant susceptible d'integration dans KADN d'une cellule-h6te de mammiffere, ledit 
plasmide comprenant un promoteur suivi en aval d*une sequence de nucleotides suivant Tune 
quelconque des revendications 1 k 9. ladite sequence de nucleotides etant suivie en aval d'un signal 
de polyadenylation. 

11- Cellules de mammifSre transfectees avec un plasmide recombinant suivant la revendication 10. 

12. Precede de production d'une proteine douee d'activite biologique pour la coagulation du sang sous la 
mediation du facteur Vila, comprenant : 

retablrssement d'une cellule-h3te de mammiffere qui contient un produit de recombinaison d'ADN 
comportant une sequence de nucleotides codant pour le facteur humain VII ; 

la croissance de ladite cellule-hote de mammtfdre dans un milieu approprie qui contient de la 
vitamine K : 

Tisolement du produit proteique code par le produit de recombinaison d'ADN eiabore par la cellule- 
h6te de mammiffere ; et 

I'activation dudit produit proteique pour engendrer une proteine qui a Tactivite biologique du facteur 
Vila pour la coagulation du sang. 

ia Precede suivant la revendication 12. comportant Tamplification du produit de recombinaison d'ADN par 
co-transfection d la c llule-hote avec un g6ne codant pour la dihydrofolate-reductase. oO le milieu 
approprie comprend du methotrexate. 
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14. Proced^ suivant la revendication 12. dans lequel le produrt proteique est active par reaction de la 
prot^ine avec un enzyme prot^olytique choisi dans te groupe comprenant le facteur Xlla, le facteur IXa. 
la kallicrdine, le facteur Xa et la thrombine. 

15. Preparation pharmaceutique destinee au traitement de troubles h^morragiques. contenant une proteine 
ayant une sequence d'amino-acides telle que representee sur la figure lb et ddpourvue de prot^ines 
humaines contaminantes. 

16. Proc^d^ de production d'une protiine douee d*activit§ biologique pour la coagulation du sang sous la 
mediation du facteur Vila, precede qui comprend la croissance. dans un milieu appropri^ contenant de 
la vitamine K, d'une cellule-hote de mammiffere ^tablie qui contient un produit de recombinaison d*ADN 
contenant une sequence de nucleotides codant pour le facteur humain VII. Tisolement du produit 
proteique cod§ par le produit de recombinaison d'ADN ^labor^ par ladite cellule-hote de mammif§re et 
I'activation du produit proteique pour engendrer une proteine qui a Tactivit^ biologique du facteur Vila 
pour la coagulation du sang. 

17. Proc666 suivant la revendication 12, 13. 14 ou 16, dans lequel la cellule-h6te est une cellule non 
hepatique. 

18. Proc^d^ de preparation d'une composition destinee au traitement de troubles hemorragiques, proc^d^ 
qui comprend la preparation d'une composition pharmaceutique contenant une proteine produtte par le 
precede suivant Tune quelconque des revendications 12. 13, 14, 16 et 17. 
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FIGo lA 



EcoRIa 2k 39 54 

GAATTCC GG TCC AGG ACG AAG CTG TTC TCG ATT TCT TAG AGT CAT GGG CAC CAG 
Arg Thr tys Leu Phe Trp lie Scr Tyr Ser Asp Gly Asp G)n 

69 99 
TGT GCC TCA AGT CCA TGC CAG AAT GGG 6GC TCC TGC AAG GAC CAG CTC CAG TCC 
Cys Ala Ser Ser Pro Cys Gin Asn Gly Gly Ser Cys Lys Asp Gin Leu Gin Ser 

11* 129 \kk 159 

TAT ATC TGC TTC TGC CTC CCT GCC TTC GAG GGC CGG AAC TGT GAG ACG CAC AAG 
Tyr lie Cys Phe Cys Leu Pro Ala Phe Glu Gly Arg Asn Cys Glu Thr His Lys 

17*1 189 20* Pst Xa 

CAT GAC CAG CTG ATC TGT GTG AAC GAG AAC GGC GGC TGT GAG CAG TA C TGC AGT 
Asp Asp Gin Leu lie Cys Val Asn Glu Asp G)y Gly Gys Gio Gin Tyr Cys Ser 

219 23* 2*9 26* 

GAC CAC ACG GGC ACC AAG CGC TCC TGT CGG TGC CAC GAG GGG TAC TCT CTG CTG 

Asp His Thr Gly Thr Lys Arg Ser Cys Arg Cys His Glu Gly Tyr Scr Leu Leu 

279 29* 309 32** 

GCA GAC GGG GTG TCC TGC ACA CCC ACA GTT CAA TAT CCA TGT GGA AAA ATA CCT 
Ala Asp Gly Val Scr Cys Thr Pro Thr Val Glu Tyr Pro Cys Gly Lys lie Pro 

Xba I 339 35* 369 

A TT CTA CAA AAA ACA AAT GCC AGC AAA CCC CAA GGC CGA ATT GTG GGG GGC AAG 
Me Leu Glu Lys Arg Asn Ala Ser Lys Pro Gin Gly Arg lie Vat Gly Gly Lys 

38* 399 *1* *29 

GTG TGC CCC AAA GGG GAG TGT CCA TGG CAG GTC CTG TTZ TTG GTG AAT GGA GCT 
Val Cys Pro Lys Gly Glu Cys Pro Trp Gin Val Leu Leu Leu Vat Asn Gly Ala 

*** *59 *7* 

CAG TTG TGT GGG GGG ACC CTG ATC AAC ACC ATC TGG GTG GTC TCC GCG GCC CAC 
Gin Leu Cys Gly Gly Thr Leu lie Asn Thr lie Trp Val Vat Ser Ala Ala His 

*89 50* 519 53* 

TGT TTC GAC AAA ATC AAG AAC TGG AGG AAC CTG ATC GCG GTG CTG GGC GAG CAC 

Cys Phe Asp Lys lie Lys Asn Trp Arg Asn Leu lie Ala Val Leu Gly Glu His 

5*9 56* 579 59* 

GAC CTC AGC GAG CAC GAC GGG GAT GAG CAG AGC CGG CGG GTG GCG CAG GTC ATC 
Asp Leu Ser Glu His Asp Gly Asp Glu Gin Ser Arg Arg Val Ala Gin Val lie 

609 Sma I 62* 639 

ATC CCC ACC ACG TAC G TC CCG GG C ACC ACC AAC CAC GAC ATC GCC CTG CTC CGC 
lie Pro Ser Thr Tyr Val Pro Gly Thr Thr Asn His Asp lie Ala Leu Leu Arg 

65* 669 68* 699 

CTG CAC CAG CCC GTC GTC CTC ACT CAC CAT GTG CTG CCC CTC TGC CTG CCC CAA 
Leu His Gin Pro Vat Val Leu Thr Asp His Val Vat Pro Leu Cys Leu Pro Glu 
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71* 729 7** 



CGC 
Arg 


ACG 
Thr 


TTC 
Phe 


TCT 
Ser 


GAG 
Glu 


ACG 
Arg 


ACG 
Thr 


CTG 
Leu 


GCC TTC GTG 
Ala Phe Val 


CGC 
Arg 


TTC 
Phe 


TCA 
Ser 


TT6 
Leu 


CTC 
Val 


AGC 
Ser 


GGC 
Cly 


759 
TOQ 
Trp 


GGC 

Cly 


CAG 
Gin 


CTG 
Leu 


CTG 
Leu 


77* 
OAC 
Asp 


CGT 
Arg 


Nar I 789 
GGC GCC ACG GCC 
Gly Ala Thr Ala 


CTG 
Leu 


GAG 
Glu 


CTC 
-leu 


ATG 
Het 


804 

6TC CTC 
Val Leu 


AAC 
Asn 


GTG 
Val 


CCC 
Pro 


819 
CGG 
Arg 


CTG 
Leu 


ATG 

net 


ACC 
Thr 


CAG 
Gin 


834 
GAC 
Asp 


TCC 
Cys 


Pst lb 
CTG CAG 
Leu Gin 


CAG 
Gin 


849 
TCA 
Ser 


CGG 
Arg 


AAG 

Lys 


GTG 
Val 


GGA 
Gly 


864 
CAC 
Asp 


TCC 
Ser 


CCA 
Pro 


AAT 
Asn 


ATC 
lie 


879 
ACG 
Thr 


GAG 
Glu 


TAC 
Tyr 


ATG 
Met 


TTC 
Phe 


894 

TGT GCC 
Cys Ala 


GGC 
Gly 


TAC 
Tyr 


TC6 
Ser 


909 
GAT 
Asp 


GGC 
Gly 


AGC 
Ser 


AAG 
Lys 


GAC 
Asp 


92h 
TCC 
Ser 


TCC 
Cys 


AAG 

Lys 


GCG 
Cly 


GAC 
Asp 


939 
AGT 
Ser 


GGA 
Gly 


GCC 
Gly 


CCA CAT 
Pro Hts 


9S4 
GCC 
Ala 


ACC 
Thr 


CAC 
His 


TAC 
Tyr 


CGG 
Arg 


969 
GGC 
Gly 


ACG 
Thr 


T6G 
Trp 


TAC 
Tyr 


CTG 
Leu 


984 
ACG 
Thr 


GGC 
Gly 


ATC 
Me 


6TC 
Val 


AGC 
Ser 


999 
TGG 
Trp 


GGC CAG 
Gly Gin 


GGC 
Gly 


1014 
TGC GCA 
Cys Ala 


ACC 
Thr 


GTG 
Val 


GCC 
Gly 


CAC 
His 



1029 1044 1059 Taql 1074 

TTT CGG GTG TAC ACC AGG CTC TCC CAG TAC ATC GAG TGG CTG CAA AAG CTC ATG 
Phe Gly Val Tyr Thr Arg Val Ser Gin Tyr lie Glu Trp Leu Gin Lys Leu Met 

1089 1104 1119 1138 

CGC TCA GAG CCA CGC CCA GGA GTC CTC CTG CGA GCC CCA TTT CCC TAG CCCAGCACCC 
Arg Ser Glu Pro Arg Pro Gly Val Leu'Leu Arg Ala Pro Phe Pro 

Pstic 

1148 1158 1168 1178 1188 II98 1208 

CTGGCCTGTC GACA6AAAGC CAAGCCTGCG TCGAACTGTC CT66CACCAA ATCCCATATA TTCTTCT6CA 



1218 1228 1238 1248 1258 1268 1278 

GTTAATGGGG TAGAGGAGGG CATGGGAGG6 AGGGAGAGGT GGGGAGGGAG ACAGAGACAG AAACAGAGAG 



1288 1298 1308 I3I8 1328 1338 1348 

AGACAGAGAC AGAGAGAGAC TGAGGGAGAG ACTCTGAGGA CCATGCACAG AGACTdAAAC AGACTCCAAC 



1358 1368 1378 1368 1398 1408 1438 

ATTCAAAGAG ACTAATAGAG ACACAGAGAT GGAATAGAAA AGATGAGAGG CAGAGGCAGA CAGGCGCTGG 



1428 1438 1448 1458 1468 1478 1468 

ACAGAGCGGC AGGGCAGTGC CAAGGTTGTC CTCGACGCAG ACAGCCCACC TCACCCTCCT TACCTCCCTT 
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U58 1*508 1518 1528 1538 'l548 1558 

CACCCAAGCC CCACCTCCAC CT6ATCTCCT GCCCCTCAGG CTGCTGCTCT GCCTTCATTC CTCCACA'CAC * 



1568 1578 1588 1598 I608 I6I8 I628 

TAGAGGCATG ACACACATGG ATGCACACAC ACACACGCCA TGCACACACA CACACATATG CACACACACC 

1638 I6C18 1658 1668 1678 1688 1698 

GATGCACACA CAGATGGTCA CACAGAGTAC CCAAACACAC CGATGCACAC GCACATAGAG ATATGCACAC 



1708 1718 1728 1738 1748 1758 1768 

ACAGATGCAC ACACAGATAT ACACATGGAG TGCACGCACA TGCCAATCCA CGCACACATC AGTGCACACG 



1778 1788 1798 1808 1818 1828 I838 

GATGCACACA CATATGCACA CACCGAT6TG CGCACACACA GATATGCACA CACATGGATG AGCACACACA 



1848 1858 1868 1878 1888 1898 1908 

CACCAAGTGC GCACACACAC CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACCG ATGCTGACTC 



1918 1928 1938 19^8 1958 1968 1978 

CATGTGT6CT 6TCCTCTCAA GGCGCTT6TT TAGCTCTCAC TTTTCT6CTT CTTATCCATT ATCATCTTCA 



1988 1998 2008 2018 2028 2038 2048 

CTTCAGACAA TTCAGAA6CA TCACCAT6CA T6GTGGCGAA TGCCCCCAAA CTCTCCCCCA AATGTATTTC 



2058 2068 207B 2088 2098 2108 2118 

TCCCTTCGCT GGCTCCCGG6 CTGCACAGAC TATTCCCCAC CTGCTTCCCA 6CTTCACAAT AAAC6GCTCC 



2128 2138 2148 2158 2168 EcoRIb 

GTCTCCTCCC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAGGAATTC 
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FIGolB 

MetValSerGlnAlaLeuArgLeuLeu 
TCAACAGGCAGGGGCAGCACTGCAGAGATTTCATCATGGTCTCCCAGGCCCTCAGGCTCCTC 
10 20 30 40 50 60 

-50 -40 i 

CysLeuLeuLeuGlyLeuGlnGlyCysLeuAlaAlaGlyGlyValAlaLysAlaSerGlyGly 
TGCCTTCTGCTTGGGCTTCAGGGCTGCCTGGCTGCAGGCGGGGTCGCTAAGGCCTCAGGAGGA 
70 80 90 100 110 120 

-30 -20 i 

GluThrArgAspMetProTrpLysProGlyProHiaArgValPheValThrGlnGluGlu 
G AAACACGGG ACATGCCGTGG AAGCCGGGGCCTC ACAG AGTCTTCGT AACCC AGG AGG AA 
130 140 150 160 170 180 

-10 -1 +1 +10 

AlaHisGlyValLeuHisArgArgArgArgAlaAsnAlaPheLeuGluGluLeuArgPro 
GCCCACGGCGTCCTGCACCGGCGCCGGCGCGCCAACGCGTTCCTGGAGGAGCTGCGGCCG 
190 200 210 220 230 240 

+20 +30 
GlySerLeuGluArgGluCysLysGluGluGlnCysSerPheGluGluAlaArgGluIle 
GGCTCCCTGGAGAGGGAGTGCAAGGAGGAGCAGTGCTCCTTCGAGGAGGCCCGGGAGATC 
250 260 270 280 290 300 

+40 +50 
PheLysAspAlaGluArgThrLysLeuPheTrpIleSerTyrSerAspGlyAspGlnCys 
TT.CAAGGACGCGGAGAGGACGAAGCTGTTCTGGATTTCTTACAGTGATGGGGACCAGTGT 
310 320 330 340 350 360 

+60 +70 
AlaSerSerProCysGlnAsnGlyGlySerCysLysAspGlnLeuGlnSerTyrlleCys 
GCCTCAAGTCCATGCCAGAATGGGGGCTCCTGCAAGGACCAGCTCCAGTCCTATATCTGC 
370 380 390 400 410 420 

+80 +90 
PheCysLeuProAlaPheGluGlyArgAsnCysGluThrHisLysAspAspGlnLeuIle 
TTCTGCCrcCCTGCCTTCGAGGGCCGGAACTGTGAGACGCACAAGGATGACCAGCTGATC 
430 440 450 460 470 480 

+100 +110 
CysValAsnGluAsnGlyGlyCysGluGlnTyrCysSerAspHisThrGlyThrLysArg 
TGTGTGAACGAGAACGGCGGCTGTGAGCAGTACTGCAGTGACCACACGGGCACCAAGCX3C 
490 500 510 520 530 540 

+120 +130 
SerCysArgCysHisGluGlyTyrSerLeuLeuAlaAspGlyValSerCysThrProThr 
TCCTGTCGGTGCCACGAGGGGTACTCTCTGCTGGCAGACGGGGTGTCCTGCACACCCACA 
550 560 570 580 590 600 

+140 +150 
ValGluTyrProCysGlyLysIleProIleLeuGluLysArgAanAlaSerLysProGln 
GTTG AAT AT CC AT GTGG AAAAAT ACCT ATTCT AG AAAAAAG AAATGCC AGC AAACCCCAA 
610 620 630 640 650 660 
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+160 +170 
GlyArglleValGlyGlyLysValCysProLysGlyGluCysProTrpGlnValLeuLeu 
GGCCGAATTGTGGGGGGCAAGGTGTGCCCCAAAGGGGAGTGTCCATGGCAGGTCCTGTTG 
670 680 690 700 710 720 

+180 +190 
LeuValAsnGlyAlaGlnLeuCysGlyGlyThrLeuIleAsnThrlleTrpValValSer 
TTGGTGAATGGAGCTCAGTTGTGTGGGGGGACCCTGATCAACACCATCTGGGTGGTCTCC 
730 740 750 760 770 780 

+200 . +210 

AlaAlaHisCysPheAspLysIleLysAsnTrpArgAsnLeuIleAlaValLeuGlyGlu 
GCGGCCCACTGTTTCGACAAAATCAAGAACTGGAGGAACCTGATCGCGGTGCTGGGCGAG 
790 880 810 820 830 840 

+220 +230 
HisAspLeuSerGluHisAspGlyAspGluGlnSerArgArgValAlaGlnValllelle 
CACGACCTCAGCGAGCACGACGGGGATGAGCAGAGCCGGCGGGTGGCGCAGGTCATCATC 
850 860 870 880 890 900 

+240 +250 
ProSerThrTyrValProGlyThrThrAsnHisAspIleAlalieuLeuArgLeuHisGln 
CCCAGCACGTACGTCCCGGGCACCACCAACCACGACATCGCGCTGCTCCGCCTGCACCAG 
910 920 930 940 950 960 

+260 +270 
ProValValLeuThrAspHisValValProLeuCysLeuProGluArgThrPheSerGlu 
CCCGTGGTCCTCACTGACCATGTGGTGCCCCTCTGCCTGCCCGAACGGACGTTCTCTGAG 
970 980 990 1000 1010 1020 

+280 +290 
ArgThrLeuAlaPheValArgPheSerLeuValSerGlyTrpGlyGlnLeuLeuAspArg 
AGGACGCTGGCCTTCGTGCGCTTCTCATTGGTCAGCGGCTGGGGCCAGCTGCTGGACCGT 
1030 1040 1050 1060 1070 1080 

+300 +310 
GlyAlaThrAlaLeuGluLeuMetValLeuAsnValProArgLeuMetThrGlnAspCys 
GGCGCCACGGCCCTGGAGCTCATGGTCCTCAACGTGCCCCGGCTGATGACCCAGGACTGC 
1090 1100 1110 1120 11'30 1140 

+320 +330 
LeuGlnGlnSer ArgLysValGlyAspSerProAsnlleThrGluTyrMetPheCysAla 
CTGCAGCAGTCACGGAAGGTGGGAGACTCCCCAAATATCACGGAGTACATGTTCTGTGCC 
1150 1160 1170 1180 1190 1200 

+340 +350 
GlyTyrSer AspGlySerLysAspSerCysLysGlyAspSerGlyGlyProHisAlaThr 
GGCTACTCGGATGGCAGCAAGGACTCCTGCAAGGGGGACAGTGGAGGCCCACATGCCACC 
1210 1220 1230 1240 1250 1260 

+360 +370 
HisTyr ArgGlyThrTrpTyrLeuThrGlylleValSerTrpGlyGlnGlyCysAlaThr 
CACTACCGGGGCACGTGGTACCTGACGGGCATCGTCAGCTGGGGCCAGGGCTGCGCAACC 
1270 1280 1290 1300 1310 1320 
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+380 +390 

ValGlyaisPheGlyVaiTyrThrArgValSerGlnTyrlleGluTrpI.euGlnLvsLeii 
GTGGGCCACTTTGGGGTGTACACCAGGGTCTCCCAGTACATCGAGTGGCTGCAAAAGCTC 
1330 1340 1350 1360 1370 1380 

+400 +406 

MetArgSerGluProArgProGlyValLeuLeuArgAlaProPhePro*** 

ATGCGCrCAGAGCCACGCCCAGGAGTCCTCCTGCGAGCCCCATTTCCCTAGCCCAGCAGC 
1390 1400 1410 1420 1430 1440 

CCTGGCCTGTGGAGAGAAAGCCAAGGCTGCGTCGAACTGTCCTGGCACCAAATCCCATAT 
1450 1460 1470 1480 1490 1500 - • 

ATTCTTCTGCAGTTAATGGGGTAGAGGAGGGCATGGGAGGGAGGGAGAGGTGGGGAGGGA 
1510 1520 1530 1540 1550 1560 

GACAGAGACAGAAACAGAGAGAGACAGAGACAGAGAGAGACTGAGGGAGAGACTCTGAGG 
1570 1580 1590 1600 1610 1620 

ACATGGAGAGAGACTCAAAGAGACTCCAAGftrPCAAAGAGACTAATAGAGACACAGAGAT 
1630 1640 1650 1660 1670 1680 

GGAATAGAAAAGATGAGAGGCAGAGGCAGACAGGC6CTGGACAGAGGGGCAGGGGAGTGC 
1690 1700 1710 1720 1730 1740 

CAAGGTTGTCCTGGAGGCAGACAGCCCAGCTGAGCCTCCTTACCTCCCTTCAGCCAAGCr 
1750 1760 1770 1780 1790 1800 

CCACCTGCACGTGATCTGCTGGCCCTCAGGCTGCTGCTCTGCCTTCATTGCTGGAGACAG 
1810 1820 1830 1840 1850 1860 

'^^?^?°^'^'''*^^^'^^*^^^''*5*'''*5*=*CACACACACACGCCAATGCACACACACAGAGATA 
1870 1880 1890 1900 1910 1920 

TGCACACACACGGATGCACACACAGATGGTCACACAGAGATACGCAAACACACCGATGCA 
1930 1940 1950 I960 1970 1980 

CACGCACATAGAGATATGCACACACAGATGCACACACAGATATACACATGGATGCACX3CA 
1990 2000 2010 2020 2030 2040 

CATGCCAATGCACGCACACATCAGTGCACACGGATGCACAGAGATATGCACACACCGATG 
2050 2060 2070 2080 2090 2100 

TGCGCACACACAGATATGCACACACATGGATGAGCACACACACACCAAGTGCGCACACAC 
2110 2120 2130 2140 2150 2160 

ACCGATGTACACACACAGATGCACACACAGATGCACACACACCGATGCTGACTCCATGTG 
2170 2180 2190 2200 2210 2220 

TGCTGTCCTCTGAAGGCGGTTGTTTAGCTCTCACTTTTCTGGTTCTTATCCATTATCATC 
2230 2240 2250 2260 2270 2280 

TTCACTTCAGACAATTCAGAAGCATCACCATGCATGGTGGCGAATGCCCCCAAACTCTCC 
2290 2300 2310 2320 2330 2340 

*^???^'''^^'^'"^*^^^'^'^^*5^<^^SCCGGGCTGCACAGACTATTCCCCACCTGCTT 
2350 2360 2370 2380 2390 2400 
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CCCAGCTTCACAATAAACGGCTGCGTCTCCTCCGCACACCTGTGGTGCCTGCCACCC 
2410 2420 2430 2240 2450 2460 

AAAAAAAAAAAAAAAAAA 
2470 2480 
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FIG. 7 

21 36 
GGATCC ATG CAG CGC GTG AAC ATG ATC ATG GCA GAA TCA CCA GGC 
MET Gin Arg Val Asn MET lie MET Ala Glu Ser Pro Gly 

66 81 
CTC ATC ACC ATC TGC CVT TTA GGA TAT CTA CTC AGT GCT GAA TGT 
Leu lie Thr lie Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys 

96 111 126 

ACA GTT TTT CTT GAT CAT GAA AAC GCC AAC AAA ATT CTG AAT CGG 
Thr Val Phe Leu Asp His Glu Asn Ala Asn Lys lie Leu Asn Arg 

141 156 171 

CCA AAG AGG TAT AAT TCA GGT AAA TTG GAA GAG TTT GTT CAA GGG 
Pro Lys Arg Tyr Asn Ser Gly Lys Leu Glu Glu Phe Val Gin Gly 

186 201 216 

AAC CTT GAG AGA GAA TGT ATG GAA GAA AAG TGT AGT TTT GAA GAA 
Asn Leu Glu Arg Glu Cys MET Glu Glu Lys Cys Ser Phe Glu Glu 

231 246 261 

GCA CGA GAA GTT TTT GAA AAC ACT GAA AGA ACA AAG CTG TTC TGG 
Ala Arg Glu Val Phe Glu Asn Thr Glu Arg Thr Lys Leu Phe Trp 

276 291 306 

ATT TCT TAC AGT GAT GGG GAC CAG TGT GCC TCA AGT CCA TGC CAG 
He Ser Tyr Ser Asp Gly Asp Gin Cys Ala Ser Ser Pro Cys Gin 

321 336 351 

AAT GGG GGC TCC TGC AAG GAC CAG CTC CAG TCC TAT ATC TGC TTC 
Asn Gly Gly Ser Cys Lys Asp Gin Leu Gin Ser Tyr He Cys Phe 

366 381 396 

TGC CTC CCT GCC TTC GAG GGC CGG AAC TGT GAG ACG CAC AAG GAT 
Cys Leu Pro Ala Phe Glu Gly Arg Asn Cys Glu Thr His Lys Asp 

411 426 441 

GAC CAG CTG ATC TGT GTG AAC GAG AAC GGC GGC TGT GAG CAG TAC 
Asp Glu Leu He Cys Val Asn Glu Asn Gly Gly Cys Glu Gin Tyr 

456 471 486 

TGC AGT GAC CAC ACG GGC ACC AAG CGC TCC TGT CGG TGC CAC GAG 
Cys Ser Asp His Thr Gly Thr Lys Arg Ser Cys Arg Cys His Glu 

501 516 531 

GGG TAC TCT CTG CTG GCA GAC GGG GTG TCC TGC ACA CCC ACA GTT 
Gly Tyr Ser Leu Leu Ala Asp Gly Val Ser Cys Thr Pro Thr Val 

546 561 576 

GAA TAT CCA TCT GGA AAA ATA CCT ATT CTA GAA AAA AGA AAT GCC 
Glu Tyr Pro Cys Gly Lys He Pro He Leu Glu Lys Arg Asn Ala 

591 606 621 

AGC AAA CCC CAA GGC CGA ATT GTG GGG GGC AAG GTG TGC CCC AAA 
Ser Lys Pro Gin Gly Arg He Val Gly Gly Lys Val Cys Pro Lys 
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636 651 666 

GGG GAG TGT CCA TGG CAG GTC CTG TTG TTG GTG AAT GGA GCT CAG 
Gly Glu Cys Pro Trp Gin Val Leu Leu Leu Val Asn Gly Ala Gin 

681 696 711 

TTG TGT GGG GGG ACC CTG ATC AAC ACC ATC TGG GTG GTC TCC GCG 
Leu Cys Gly Gly Thr Leu He Asn Thr He Trp Val Val Ser Ala 

726 741 756 

GCC CAC TGT TTC GAC AAA ATC AAG AAC TGG AGG AAC CTG ATC GCG 
Ala His Cys Phe Asp Lys He Lys Asn Trp Arg Asn Leu He Ala 

771 786 801 

GTG CTG GGC GAG CAC GAC CTC AGC GAG CAC GAC GGG GAT GAG CAG 
Val Leu Gly Glu His Asp Leu Ser Glu His Asp Gly Asp Glu Gin 

816 831 846 

AGC CGG CGG GTG GCG CAG GTC ATC ATC CCC AGC ACG TAC GTC CCG 
Ser Arg Arg Val Ala Gin Val He He Pro Ser Thr Tyr Val Pro 

861 876 891 

GGC ACC ACC AAC CAC GAC ATC GCG CTG CTC CGC CTG CAC CAG CCC 
Gly Thr Thr Asn His Asp He Ala Leu Leu Arg Leu His Gin Pro 

906 921 936 

GTG GTC CTC ACT GAC CAT GTG GTG CCC CTC TGC CTG CCC GAA CGG 
Val Val Leu Thr Asp His Val Val Pro Leu Cys Leu Pro Glu Arg 

951 966 981 

ACG TTC TCT GAG AGG ACG CTG GCC TTC GTG CGC TTC TCA TTG GTC 
Thr Phe Ser Glu Arg Thr Leu Ala Phe Val Arg Phe Ser Leu Val 

996 1011 1026 

AGC GGC TGG GGC CAG CTG CTG GAC CGT GGC GCC ACG GCC CTG GAG 
Ser Gly Trp Gly Gin Leu Leu Asp Arg Gly Ala Thr Ala Leu Glu 

1041 1056 1071 

CTC ATG GTC CTC AAC GTG CCC CGG CTG ATG ACC CAG GAC TGC CTG 
Leu MET Val Leu Asn Val Pro Arg Leu MET Thr Gin Asp Cys Leu 

1086 1101 1116 

CAG CAG TCA CGG AAG GTG GGA GAC TCC CCA AAT ATC ACG GAC TAC 
Gin Gin Ser Arg Lys Val Gly Asp Ser Pro Asn He Thr Glu Tyr 

1131 1146 1161 

ATG TTC TGT GCC GGC TAC TCG GAT GGC AGC AAG GAC TCC TGC AAG 
MET Phe Cys Ala Gly Tyr Ser Asp Gly Ser Lys Asp Ser Cys Lys 

1176 1191 1206 

GGG GAC AGT GGA GGC CCA CAT GCC ACC CAC TAC CGG GGC ACG TGG 
Gly Asp Ser Gly Gly Pro His Ala Thr His Tyr Arg Gly Thr Trp 

1221 1236 1251 

TAC CTG ACG GGC ATC GTC AGC TGG GGC CAG GGC TGC GCA ACC GTG 
Tyr Leu Thr gly He Val Ser Trp Gly Gin Gly Cys Ala Thr Val 
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1266 1281 1296 

GGC CAC TTT GGG GTG TAC ACC AGG GTC TCC CAG TAC ATC GAG TGG 
Gly hxs Phe Gly Val Tyr Thr Arg Val Ser Gin Tyr lie ??p 

1311 1326 2.2*1 

CTG CAA AAG CTC ATG CGC TCA GAG CCA CGC CCA GGA GTC CTC CTr 
Leu Gin Lys Leu MET Arg Ser Glu Pro Arg Pro Gly Val S5 Su 

pfS CCCAGcicc? CTGGCciG?? GAGAGaJiIg?. , . 

1408 1418 1428 1438 144fl 

CAAGGCTGCG TCGAACTGTC CTGGCACCAA ATCCCATATA TTCTTCTGCA 

1458 1468 1478 1488 149a 

GTTAATGGGG TAGAGGAGGG CATGGGAGGG AGGGAGAGGT GGGGAGGGAG 

1508 1518 1528 1538 1548 

ACAGAGACAG AAACAGAGAG AGACAGAGAC AGAGAGAGAC TGAGGGAGAG 

1558 1568 1578 1588 159a 

ACTCTGAGGA CCATGGAGAG AGACTCAAAG AGACTCCAAG ATTCAAAGAG 

1608 1618 1628 1638 1648 

ACTAATAGAG ACACAGAGAT GGAATAGAAA AGATGaGAGG CAGAGGCAGA 

1658 1668 1678 1688 1698 

CAGGCGCTGG ACAGAGGGGC AGGGGAGTGC CAAGGTTGTC CTGGAGGCAG 

1708 1718 1728 1738 174B 

ACAGCCCAGC TGAGCCTCCT TACCTCCCTT CAGCCAAGCC CCACCTGCAC 

1758 1768 1778 1788 1798 

GTGATCTGCT GGCCCTCAGG CTGCTGCTCT GCCTTCATTG CTGGAGACAG 

1808 1818 1828 1838 1848 

TAGAGGCATG ACACACATGG ATGCACACAC ACACACGCCA TGCACACACA 

1858 1868 1878 1888 1898 

CAGAGATATG CACACACACG GATGCACACA CAGATGGICA CACAGAGTAC 

1908 1918 1928 1938 194B 

GCAAACACAC CGATGCACAC GCACATAGAG ATATGCACAC ACAGATGCAC 
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1958 1968 1978 1988 



1998 

ACACAGATAT ACACATGGAG TGCACGCACA TGCCAATGCA CGCACACATC 



2008 2018 2028 2038 



— 2048 

AGTGCACACG GATGCACAGA GATATGCACA CACCGATGTG CGCACACACA 

2058 2068 2078 2088 2098 

GATATGCACA CACATGGATG AGCACACACA CACCAAGTGC GCACACACAC 

2108 2118 2128 2138 2148 

CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACCG ATGCTGACTC 

2158 2168 2178 2188 2198 

CATGTGTGCT GTCCTCTGAA GGCGGTTGTT TAGCTCTCAC TTTTCTGGTT 

2208 2218 2228 2238 2248 

CTTATCCATT ATCATCTTCA CTTCAGACAA TTCAGAAGCA TCACCATGCA 

2258 2268 2278 2288 2298 

TGGTGGCGAA TGCCCCCAAA CTCTCCCCCA AATGTATTTC TCCCTTCGCT 

2308 2318 2328 2338 2348 

GGGTGCCGGG CTGCACAGAC TATTCCCCAC CTGCTTCCCA GCTTCACAAT 



2358 2368 2378 2388 2398 

AAACGGCTGC GTCTCCTCGC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

2408 2418 2428 2438 

AAAAAAAAAA AAGGAATTCG AGCTCGGTAC CCGGGGATCC 
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FIG. 12 
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